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Abstract 

The main purpose of this survey is to introduce the inexperienced reader to additive 
prime number theory and some related branches of analytic number theory. We state 
the main problems in the field, sketch their history and the basic machinery used to 
study them, and try to give a representative sample of the directions of current research. 
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1 Introduction 

Additive number theory is the branch of number theory that studies the representations of 
natural numbers as sums of integers subject to various arithmetic restrictions. For example, 
given a sequence of integers 

A = {ai < a 2 < a 3 < ■ ■ ■ } 

one often asks what natural numbers can be represented as sums of a fixed number of 
elements of A; that is, for any fixed s6N, one wants to find the natural numbers n such 
that the diophantine equation 

(1.1) X\ + • • • + x s = n 

has a solution in X\, . . . ,x s G A. The sequence A may be described in some generality (say, 
one may assume that A contains "many" integers), or it may be a particular sequence of 
some arithmetic interest (say, A may be the sequence of kth powers, the sequence of prime 
numbers, the values taken by a polynomial F(X) G TL\X\ at the positive integers or at the 
primes, etc.). In this survey, we discuss almost exclusively problems of the latter kind. The 
main focus will be on two questions, known as Goldbach's problem and the Waring-Goldbach 
problem, which are concerned with representations as sums of primes and powers of primes, 
respectively. 

1.1 Goldbach's problem 

Goldbach's problem appeared for the first time in 1742 in the correspondence between Gold- 
bach and Euler. In modern language, it can be stated as follows. 
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Goldbach's Conjecture. Every even integer n > 4 is the sum of two primes, and every 
odd integer n > 7 is the sum of three primes. 

The two parts of this conjecture are known as the binary Goldbach problem and the 
ternary Goldbach problem, respectively. Clearly, the binary conjecture is the stronger one. 
It is also much more difficult. 

The first theoretical evidence in support of Goldbach's conjecture was obtained by Brun |2Tj . 
who showed that every large even integer is the sum of two integers having at most nine 
prime factors. Brun also obtained an upper bound of the correct order for the number of 
representations of a large even integer as the sum of two primes. 

During the early 1920s Hardy and Littlewood [H7J [Z2] developed the ideas in an earlier 
paper by Hardy and Ramanujan [73] into a new analytic method in additive number theory. 
Their method is known as the circle method. In 1923 Hardy and Littlewood |BTH ITT] applied 
the circle method to Goldbach's problem. Assuming the Generalized Riemann Hypothesis 1 
(GRH), they proved that all but finitely many odd integers are sums of three primes and 
that all but O (x 1 ' 2 ^) even integers n < x are sums of two primes. (Henceforth, e denotes 
a positive number which can be chosen arbitrarily small if the implied constant is allowed 
to depend one.) 

During the 1930s Schnirelmann |201j developed a probabilistic approach towards prob- 
lems in additive number theory. Using his method and Brun's results, he was able to prove 
unconditionally that there exists a positive integer s such that every sufficiently large integer 
is the sum of at most s primes. Although the value of s arising from this approach is much 
larger than the conjectured s = 3, Schnirelmann 's result represented a significant achieve- 
ment, as it defeated the popular belief at the time that the solution of Goldbach's problem 
must depend on GRH. (Since its first appearance, Schnirelmann's method has been polished 
significantly. In particular, the best result to date obtained in this fashion by Ramare [193 
states that one can take s = 7.) 

In 1937 I. M. Vinogradov 236 found an ingenious new method for estimating sums over 
primes, which he applied to the exponential sum 

(1.2) /(a) = 5>(op), 

where a is real, p denotes a prime, and e(a) = exp(27ria). Using his estimate for f(a), 
Vinogradov was able to give a new, unconditional proof of the result of Hardy and Littlewood 
on the ternary Goldbach problem. His result is known as Vinogradov's three prime theorem. 

Theorem 1 (Vinogradov, 1937). For a positive integer n, let R(n) denote the number of 
representations of n as the sum of three primes. Then 

n 2 

(1.3) R{n) = ^^&(n) + O (n 2 (lo g n)^) , 
1 An important conjecture about certain Dirichlct series; see ij2.2l for details. 
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where 



<"> «(») = n (i - n (i + ^hji) • 

p|n N pfrt x v ' ' 

In particular, every sufficiently large odd integer is the sum of three primes. 

The products in (jl.4|) are over the primes dividing n and over those not dividing n, 
respectively. In particular, when n is even, we have &(n) = 0, making ()1.3|) trivial. On the 
other hand, when n is odd, we have &(n) > 1. We describe the proof of Theorem \I\m ^3.11 

It should be noted that the independence of GRH in Theorem ^ comes at the price of 
a mind-boggling implied constant. If one avoids O-notation and makes all the constants 
explicit, one finds that the original (GRH- dependent) work of Hardy and Littlewood estab- 
lishes the ternary Goldbach conjecture for n > 10 50 , whereas Vinogradov's method requires 
n > io 6800000 and even its most refined version available today (see Liu and Wang 163 ) 
requires n > 10 1346 . To put these numbers in perspective, we remark that even the bound 
10 50 is beyond hope of "checking the remaining cases by a computer". In fact, only recently 
have Deshouillers et al. [3T] proved that if GRH is true, the ternary Goldbach conjecture 
holds for all odd n > 7. 

In 1938, using Vinogradov's method, Chudakov 42 , van der Corput 43., and Ester- 
mann each showed that almost all even integers n < x are sums of two primes. More 
precisely, they proved that for any A > we have 

(1.5) E{x) = O (x(\ogx)- A ) , 

where E(x) denotes the number of even integers n < x that cannot be represented as the 
sum of two primes. The first improvement on (jl.5j) was obtained by Vaughan [220J. It was 
followed by a celebrated work by Montgomery and Vaughan |173j from 1975, in which they 
established the existence of an absolute constant 5 > such that 

(1.6) E{x) = O (x 1 - 6 ) . 

The first to compute an explicit numerical value for 5 were Chen and Pan [3^] • They showed 
that the method of Montgomery and Vaughan yields (jl.6|) with 5 = 0.01. Subsequently, 
this result has been sharpened by several authors and currently (J1.6)) is known to hold with 
5 = 0.086 (see Li 136 J. In June 2004, Pintz |186j announced a further improvement on 
(|1.6|) . He has established the above bound with 5 = | and can also show that for all but 
0(x 3//5+e ) even integers n < x either n or n — 2 is the sum of two primes. 

One may also think of the binary Goldbach conjecture as a claim about the primes in 
the sequence 

(1.7) A = A(n) = {n — p : p prime number, 2 < p < n} , 

namely, that such primes exist for all even n > 6. Denote by P r an integer having at most 
r prime factors, counted with their multiplicities, and refer to such a number as an almost 
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prime of order r (thus, Brun's result mentioned above asserts that every large even n can be 
represented in the form n = P 9 + Pg). In 1947 Renyi fl95| proved that there is a fixed integer 
r such that the sequence A contains a P r -number when n is sufficiently large. Subsequent 
work by many mathematicians reduced the value of r in Renyi's result almost to the possible 
limit and fell just short of proving the binary Goldbach conjecture. The best result to date 
was obtained by Chen 



Theorem 2 (Chen, 1973). For an even integer n, let r(n) denote the number of represen- 
tations of n in the form n = p + P2, where p is a prime and P2 is an almost prime of order 
2. There exists an absolute constant Uq such that if n > n , then 



*>>™n(i-^)n(£) 



n 



p — 2 J (logn) 2 

In particular, every sufficiently large even integer n can be represented in the form n = p+P2- 



p>2 v ^ ' ' p>2 
p\n 



1.2 Waring's problem 

Before proceeding with the Waring-Goldbach problem, we will make a detour to present the 
most important results in Waring's problem, as those results and the work on Goldbach's 
problem have been the main motivation behind the Waring-Golbach problem. It was prob- 
ably the ancient Greeks who first observed that every positive integer is the sum of four 
integer squares, but it was not until 1770 that a complete proof of this remarkable fact was 
given by Lagrange. Also in 1770, Waring proposed a generalization of the four square theo- 
rem that became known as Waring's problem and arguably led to the emergence of additive 
number theory. In modern terminology, Waring's conjecture states that for every integer 
k > 2 there exists an integer s = s(k) such that every natural number n is the sum of at 
most s kth powers of natural numbers. Several special cases of this conjecture were settled 
during the 19th century, but the complete solution eluded mathematicians until 1909, when 
Hilbert [SHj proved the existence of such an s for all k by means of a difficult combinatorial 
argument. 

Let g(k) denote the least possible s as above. Hilbert's method produced a very poor 
bound for g(k). Using the circle method, Hardy and Littlewood were able to improve greatly 
on Hilbert's bound for g(k). In fact, through the efforts of many mathematicians, the circle 
method in conjunction with elementary and computational arguments has led to a nearly 
complete evaluation of g(k). In particular, we know that g(k) is determined by certain special 
integers n < 4 fc that can only be represented as sums of a large number of kth powers of 1, 
2 and 3 (see |228| §1.1] for further details on g(k)). 

A much more difficult question, and one that leads to a much deeper understanding of 
the additive properties of kth. powers, is that of estimating the function G(k), defined as the 
least s such that every sufficiently large positive integer n is the sum of s kth powers. This 
function was introduced by Hardy and Littlewood [70], who obtained the bound 

(1.8) G(k) < (k - 2)2 fc ~ 1 + 5. 
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In fact, they proved more than that. Let Ik, s ( n ) denote the number of solutions of the 
diophantine equation 

(1.9) x\ + x k 2 + --- + x k s =n 

in Xx, . . . , x s G N. Hardy and Littlewood showed that if s > (k — 2)2 fc_1 + 5, then 

p (l + I) 

(1.10) I k>s (n) hl &k ^ n ) n s/k-i as n QO) 

where T stands for Euler's gamma-function and ©fc jS (n) is an absolutely convergent infinite 
series, called the singular series, such that 

&k,s( n ) > ci(k,s) > 0. 

While the upper bound ()1.8|) represents a tremendous improvement over Hilbert's result, 
it is still quite larger than the trivial lower bound G(k) > k + l. 2 During the mid-1980s 
I. M. Vinogradov introduced several refinements of the circle method that allowed him to 
obtain a series of improvements on (|1.8|) for large k. In their most elaborate version, Vino- 
gradov's methods yield a bound of the form 3 

G(k) < 2 k (log k + (log log k)). 

First published by Vinogradov |24(Jj in 1959, this bound withstood any significant improve- 
ment until 1992, when Wooley |245j proved that 

G(k) < k(logk + \oglogk + 0(l)). 

The latter is the sharpest bound to date for G(k) when k is large. For smaller k, one can 
obtain better results by using more specialized techniques (usually refinements of the circle 
method). The best known bounds for G(k), 3 < k < 20, are of the form G(k) < F(k), with 
F(k) given by Table 1 below. 



k 


3 


4 


5 


6 


7 


8 9 


10 


11 


12 


13 


14 


15 


16 


17 


18 


19 


20 


F(k) 


7 


16 


17 


24 


33 


42 50 


59 


67 


76 


84 


92 


100 


109 


117 


125 


134 


142 



Table 1. Bounds for G(k), 3 < k < 20. 



2 Let X be large. If n < X, any solution of H1.9|) must satisfy 1 < xi, . . . , x s < X x l k . There at most X s / k 
such s-tuples, which yield at most (1/s! + o{\))X s / k distinct sums x\ + • • • + x k . Thus, when s < k, there 
are not enough sums of s kth powers to represent all the integers. 

3 In this and similar results appearing later, one can obtain an explicit expressions in place of the O-terms, 
but those are too complicated to state here. 
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With the exception of the bound (7(3) < 7, all of these results have been obtained by an 
iterative version of the circle method that originated in the work of Davenport |1UJ HB| 
and Davenport and Erdos [50J. The bound for G(3) was established first by Linnik [141 j 
and until recently lay beyond the reach of the circle method. The result on G(A) is due to 
Davenport jUj, and in fact states that G(4) = 16. This is because 16 biquadrates are needed 
to represent integers of the form n — 31 • 16 r , r£N. Other than Lagrange's four squares 
theorem, this is the only instance in which the exact value of G(k) is known. However, 
Davenport jUj also proved that if s > 14, all sufficiently large integers n = r (mod 16), 
1 < r < s, can be written as the sum of s biquadrates; Kawada and Wooley |12Uj obtained 
a similar result for as few as 11 biquadrates. The remaining bounds in Table 1 appear in a 
series of recent papers by Vaughan and Wooley |229|- [2*52*] . 

A great deal of effort has also been dedicated to estimating the function G(k), which 
represents the least s for which the asymptotic formula (|1.1U|) holds. For large k, Ford [57] 
showed that 



thus improving on earlier work by Vinogradov |238j . Hua jlOlj . and Wooley |246j . Further- 
more, Vaughan [226. 1227] and Boklan ^H] obtained the bounds 



which supersede (J1.11J) when k < 8. 

The work on Waring's problem has inspired research on several other questions concerned 
with the additive properties of A;th powers (and of more general polynomial sequences). Such 
matters, however, are beyond the scope of this survey. The reader interested in a more 
comprehensive introduction to Waring's problem should refer to the monographs I228j or 
to a recent survey article by Vaughan and Wooley |233j (the latter also provides an excellent 
account of the history of Waring's problem). 

1.3 The Waring— Goldbach problem 

Vinogradov's proof of the three prime theorem provided a blueprint for subsequent applica- 
tions of the Hardy-Littlewood circle method to additive problems involving primes. Shortly 
after the publication of Theorem Q Vinogradov himself |237j and Hua |lUUj began studying 
Waring's problem with prime variables, known nowadays as the Waring- Goldbach problem. 
They were able to generalize the asymptotic formula ()1.3|) to fcth powers for all k > 1 and 
ultimately their efforts led to the proof of Theorem |H] below. 

In order to describe the current knowledge about the Waring-Goldbach problem, we first 
need to introduce some notation. Let k be a positive integer and p a prime. We denote by 
6 = 9(k,p) the (unique) integer such that p e \ k and p e+1 \ k, and then define 



(1.11) 



G(k) < fc 2 (logfc + loglogfc + 0(l)) 



G(k) <2 k (k > 3) and G(k) < § • 2 k 



(k>6), 



(1.12) 
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In particular, we have K(l) = 2. It is not difficult to show that if an integer n is the sum 
of s kth powers of primes greater than k + 1, then n must satisfy the congruence condition 
n = s (mod K(k)). Furthermore, define 

(/i,g)=l (o,5)=l 

where (a, g) stands for the greatest common divisor of a and q, and <fi(q) is Euler's totient 
function, that is, the number of positive integers n < q which are relatively prime to q. The 
following result will be established in < J3.3I and 



Theorem 3. Let k,s and n be positive integers, and let R* ks (n) denote the number of solu- 
tions of the diophantine equation 

(1.14) Vi+V k 2 + --- + V k s =n 

in primes p±, . . 

if 1 < k < 5, 
if 6 < k < 8, 
fc 2 (logfc + loglogfc + 0(l)), if k > 8. 




Then 



r * (i + %) 



n 



s/k-l 



(!- 15 ) 6t,(n)- as n -> oo, 

v ' k ' sK ' T (f) ' ; (logn) s 

where &l s (n) is defined by (|1.13jl . Furthermore, the singular series &l s (n) is absolutely 
convergent, and if n = s (mod K{k)), then &* ks {n) > C2(k,s) > 0. 

In particular, we have the following corollaries to Theorem |3] 

Corollary 3.1. Every sufficiently large integer n = 5 (mod 24) can be represented as the 
sum of five squares of primes. 

Corollary 3.2. Every sufficiently large odd integer can be represented as the sum of nine 
cubes of primes. 

Hua introduced a function H(k) similar to the function G(k) in Waring's problem. H(k) 
is defined as the least integer s such that equation (|1.14|) has a solution in primes p%, . . . ,p s 
for all sufficiently large n = s (mod K(k)). It is conjectured that H(k) = k + 1 for all k > 1, 
but this conjecture has not been proved for any value of k yet. When k < 3, the sharpest 
known upper bounds for H(k) are those given by Theorem El that is, 

H(l) < 3, H(2) < 5, H(3) < 9. 

When k > 4, the best results in the literature follows. 
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Theorem 4. Let k > 4 be an integer, and let H(k) be as above. Then 
H(k) < 

where F(k) is given by the following table 



F(k), if 4 < k < 10, 

£;(41og£; + 21oglog£; + 0(l)), if k > 10, 



k 


4 5 


6 


7 


8 9 


10 


F(k) 


14 21 


33 


46 


63 83 


107 



Table 2. Bounds for H(k), 4 < k < 10. 



The cases k = 6 and 8 < fc < 10 of Theorem 0] are due to Thanigasalam |211j . and the 
cases k — 4,5 and 7 are recent results of Kawada and Wooley |121j and Kumchev L 127j, 
respectively. The bound for k > 10 is an old result of Hua, whose proof can be found in 
Hua's book |102j . To the best of our knowledge, this is the strongest published result for 
large k, although it is well-known to experts in the field that better results are within the 
reach of Wooley's refinement of Vinogradov's methods. In particular, by inserting Theorem 
1 in Wooley |247j into the machinery developed in Hua's monograph, one obtains 

H(k) < fc(|log& + 0(loglog k)) for k — > oo. 



1.4 Other additive problems involving primes 

There are several variants and generalizations of the Waring-Goldbach problem that have 
attracted a lot of attention over the years. For example, one may consider the diophantine 
equation 

(1.16) a 1 p k 1 +a 2 p2^ Va s p k s =n, 

where n,a x , . . . ,a s are fixed, not necessarily positive, integers. There are several questions 
that we can ask about equations of this form. The main question, of course, is that of 
solubility. Furthermore, in cases where we do know that (jl,16j) is soluble, we may want to 
count the solutions with pi, . . . ,p s < X , where X is a large parameter. A famous problem of 
this type is the twin-prime conjecture: there exist infinitely many primes p such that p + 2 
is also prime, that is, the equation 

Pi - Vi = 2 

has infinitely many solutions. It is believed that this conjecture is of the same difficulty as 
the binary Goldbach problem, and in fact, the two problems share a lot of common history. 
In particular, while the twin-prime conjecture is still open, Chen's proof of Theorem [21 can 
be easily modified to establish that there exist infinitely many primes p such that p + 2 = P 2 . 

Other variants of the Waring-Goldbach problem consider more general diophantine equa- 
tions of the form 

f(pi) + f(p2) + --- + f(Ps)=n, 



S 



where f(X) E Z[X], or systems of equations of the types (JTTTj) or f)1.16j) . For example, 
Chapters 10 and 11 in Hua's monograph |lU2j deal with the system 

p{+p{ + ---+Pl = nj (l<j<k). 

The number of solutions of this system satisfies an asymptotic formula similar to (jl.l5j) . but 
the main term in that asymptotic formula is less understood than the main term in (j!.15j) 
(see |3 El Una CZni for further details). 

Another classical problem in which a system of diophantine equations arises naturally 
concerns the existence of non-trivial arithmetic progressions consisting of r primes. It has 
been conjectured that for every integer r > 3 there are infinitely many such arithmetic 
progressions. In other words, the linear system 

Pi - 2p i+1 + p i+2 = (1 < i < r - 2) 

has infinitely many solutions in distinct primes p±, . . . ,p r . In the case r = 3 this can be 
established by a variant of Vinogardov's proof of the three primes theorem, but when r > 3 
the above system lies beyond the reach of the circle method. In fact, until recently the most 
significant insight into progressions of more than three primes were the following two results: 

• Heath-Brown jHH| succeeded to prove that there exist infinitely many arithmetic pro- 
gressions of three primes and a P2-number. 

• Balog "fll] proved that for any r there are r distinct primes pi, . . . ,p r such that all the 
averages ~(pi +Pj) are prime. 

Thus, the specialists in the field were stunned when Green and Tao [El] announced their 
amazing proof of the full conjecture. The reader will find a brief description of their ideas 
and of some related recent work in the last section. 

Finally, instead of (jl.ljl . one may study the inequality 

\xi + - • • + x 8 — a\ < e, 

where a is a real number, e is a small positive number and real variables taking 

values from a given sequence (or sequences). For example, by setting Xj = p c j where c > 1 
in not an integer, we can generalize the Waring-Goldbach problem to fractional powers of 
primes. We will mention several results of this form in S J5.71 

2 The distribution of primes 

In this section we discuss briefly some classical results about primes, which play an important 
role in additive prime number theory. 
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2.1 The Prime Number Theorem 



The first result on the distribution of primes is Euclid's theorem that there are infinitely 
many prime numbers. In 1798 Legendre conjectured that the prime counting function tt(x) 
(i.e., the number of primes p < x) satisfies the asymptotic relation 

2.1 lini V ; = 1; 

x^co xj {iOgX) 



this is the classical statement of the Prime Number Theorem. Later Gauss observed that 
the logarithmic integral 

lix = 



dt 
log* 

seemed to provide a better approximation to ir(x) than the function x/(logx) appearing in 
(12. lj) . and this is indeed the case. Thus, in anticipation of versions of the Prime Number 
Theorem that are more precise than (j2.1|) . we define the error term 

(2.2) A(x) = 7r(x) - liar. 

The first step toward a proof of the Prime Number Theorem was made by Chebyshev. 
In the early 1850s he proved that (|2.1j) predicts correctly the order of ir(x), that is, he 
established the existence of absolute constants c 2 > C\ > such that 

c\x c 2 x 

< 7T{X) < 



log x log X 

Chebyshev also showed that if the limit on the left side of ()2.1|) exists, then it must be equal 
to 1. 

In 1859 Riemann published his famous memoir |197j . in which he demonstrated the 
intimate relation between ir(x) and the function which now bears his name, that is, the 
Riemann zeta-function defined by 

oo 

(2.3) as) = J2 n ~ S = U( 1 -P'T 1 (Re(s)>l). 

n=l p 

This and similar series had been used earlier by Euler 4 and Dirichlet, but only as functions of 
a real variable. Riemann observed that ((s) is holomorphic in the half-plane Re(s) > 1 and 
that it can be continued analytically to a meromorphic function, whose only singularity is a 
simple pole at s — 1. It is not difficult to deduce from (J2.3)) that £(s) ^ in the half-plane 
Re(s) > 1. Riemann observed that C{s) has infinitely many zeros in the strip < Re(s) < 1 
and proposed several conjectures concerning those zeros and the relation between them and 
the Prime Number Theorem. The most famous among those conjecture — and the only one 
that is still open — is known as the Riemann Hypothesis. 

4 In particular, Euler established the equality between £(s) and the infinite product in (|2.3|l . which is 
known as the Euler product of C,{s). 
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Riemann Hypothesis (RH). All the zeros of ((s) with < Re(s) < 1 lie on the line 
Re(s) = \. 

The remaining conjectures in Riemann's paper were proved by the end of the 19th century. 
In particular, it was proved that the Prime Number Theorem follows from the nonvanishing 
of £(s) on the line Re(s) = 1. Thus, when in 1896 Hadamard and de la Vallee Poussin 
proved (independently) that £(1 + it) ^ for all real t, the Prime Number Theorem was 
finally proved. In 1899 de la Vallee Poussin obtained the following quantitative result. 5 
(Henceforth, we often use Vinogradov's notation i < B, which means that A = 0(B).) 

Theorem 5 (de la Vallee Poussin, 1899). Let A(x) be defined by (|2.2j) . There exists an 
absolute constant c > such that 

A(x) -C xexp ( — c\/\ogx). 

De la Vallee Poussin's theorem has been improved somewhat, but not nearly as much as 
one would hope. The best result to date is due to I. M. Vinogradov 239J and Korobov 123J, 
who obtained (independently) the following estimate for A(x). 

Theorem 6 (Vinogradov, Korobov, 1958). Let A(x) be defined by ()2.2|) . There exists 
an absolute constant c > such that 

A(x) <C xexp ( — c(logx) 3//5 (loglogx) -1 / 5 ). 

In comparison, if the Riemann Hypothesis is assumed, one has 

(2.4) A(x) < x 1/2 logx, 

which, apart from the power of the logarithm, is best possible. The reader can find fur- 
ther information about the Prime Number Theorem and the Riemann zeta-function in the 
standard texts on the subject (e.g., j4TH ITU31 ITT71 IP3T1 12T2| ). 



2.2 Primes in arithmetic progressions 

In a couple of memoirs published in 1837 and 1840, Dirichlet proved that if a and q are 
natural numbers with (a, q) = 1, then the arithmetic progression a mod q contains infinitely 
many primes. In fact, Dirichlet's argument can be refined as to establish the asymptotic 
formula 

(2.5) > —— \ as x-^oo, 

p=a (mod q) 

5 Functions of the type f(x) = exp ((logx) A ), where A is a constant, are quite common in analytic number 
theory. To help the reader appreciate results such as Theorems and El we remark that as x — > oo such a 
function with < A < 1 grows more rapidly than any fixed power of log a;, but less rapidly than x e for any 
fixed e > 0. 
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valid for all a and g with (a, q) = 1. Fix q and consider the various arithmetic progressions 
a mod q (here 0(g) is Euler's totient function). Since all but finitely many primes lie 
in progressions with (a, q) = 1 and there are 0(g) such progressions, (|2.5|) suggests that 
each arithmetic progression a mod g, with (a, g) = 1, "captures its fair share" of prime 
numbers, i.e., that the primes are uniformly distributed among the (appropriate) arithmetic 
progressions to a given modulus q. Thus, one may expect that if (a, g) = 1, then 

(2.6) 7r(x;g, a)= 1 ~ — — as x — > oo. 

p=a (mod q) 

This is the prime number theorem for arithmetic progressions. One may consider (J2.6)) from 
two different view points. First, one may fix a and g and ask whether ()2.6|) holds (allowing the 
convergence to depend on q and a). Posed in this form, the problem is a minor generalization 
of the Prime Number Theorem. In fact, shortly after proving Theorem^ de la Vallee Poussin 
established that 

A(x; q, a) = vr(x; q, a) - -J^- < x exp ( - c^J log x) , 

0(g) 

where c = c(g, a) > and the implied constant depends on g and a. The problem becomes 
much more difficult if one requires an estimate that is explicit in g and uniform in a. The 
first result of this kind was obtained by Page |f 76j . who proved the existence of a (small) 
positive number 5 such that 

(2.7) A(x;q,a) < xexp (-(log a;) 5 ) , 

whenever 1 < q < (logx) 2-5 and (a, q) = 1. In 1935 Siegel |2U8j (essentially) proved the 
following result known as the Siegel- Walfisz theorem. 

Theorem 7 (Siegel, 1935). For any fixed A > 0, there exists a constant c = c(A) > 
such that . 

ir(x;q,a) = —— + 0(xexp ( - c A/log x)) 
0(g) 

whenever q < (logx) A and (a, g) = 1. 

Remark. While this result is clearly sharper than Page's, it does have one significant draw- 
back: it is ineffective, that is, given a particular value of A, the proof does not allow the 
constant c(A) or the O-implied constant to be computed. 

The above results have been proved using the analytic properties of a class of generaliza- 
tions of the Riemann zeta-function known as Dirichlet L-functions. For each positive integer 
q there are 0(g) functions \ : Z — > C, called Dirichlet characters mod g, with the following 
properties: 

• x is totally multiplicative: x{ mn ) — x{ m )x( n )'i 
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X is g-periodic; 

\x( n )\ = 1 if (n, q) = 1 and x( n ) = if (n,q) > 1; 
if (n, q) = 1, then 



X mod q 



4>{q) if n = 1 (mod g), 
otherwise. 



For more information about the construction and properties of the Dirichlet characters we 
refer the reader to gg QUI DHH EES] • 

Given a character x mod g, we define the Dirichlet L-function 

oo 

L(s, x) = ^ n >' S = II i 1 - X{p)p- S ) ~ X (Ms) > !)• 

n=l p 

Similarly to ({s), L(s,x) is holomorphic in the half-plane Re(s) > 1 and can be continued 
analytically to a meromorphic function on C that has at most one pole, which (if present) 
must be a simple pole at s = 1. Furthermore, just as C( s )> the continued L(s, x) has infinitely 
many zeros in the strip < Re(s) < 1, and the horizontal distribution of those zeros has 
important implications on the distribution of primes in arithmetic progressions. For example, 
the results of de la Vallee Poussin, Page and Siegel mentioned above were proved by showing 
that no L-function can have a zero "close" to the line Re(s) = 1. We also have the following 
generalization of the Riemann Hypothesis. 

Generalized Riemann Hypothesis (GRH). Let L(s,x) be a Dirichlet L-function. Then 
all the zeros of L(s,x) with < Re(s) < 1 lie on the line Re(s) = \. 

Assuming GRH, we can deduce easily that if (a, q) = 1, then 
(2.8) 7r(x;g,a) = ^ + O (x 1 / 2 \ogx) , 

which is nontrivial when 1 < q < x 1 / 2 (loga;)~ 2_e . 

In many applications one only needs (|2.8J1 to hold "on average" over the moduli q. Dur- 
ing the 1950s and 1960s several authors obtained estimates for averages of A(x;q, a). In 
particular, the following quantity was studied extensively: 



E(x,Q) = y max max | A (y; q, a) \ . 

(a,q)=l y<x 



(a,q)=l y<3 

q<Q 

The trivial bound for this quantity is E(x, Q) <C x log x. One usually focuses on finding the 
largest value of Q for which one can improve on this trivial bound, even if the improvement 
is fairly modest. The sharpest result in this direction was established (independently) by 
Bombieri ^Hj and A. I. Vinogradov |234j in 1965. Their result is known as the Bombieri- 
Vinogradov theorem and (in the slightly stronger form given by Bombieri) can be stated as 
follows. 
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Theorem 8 (Bombieri, Vinogradov, 1965). For any fixed A > 0, there exists a B = 
B(A) > such that 

(2.9) E(x,Q) <x(logx)" A , 
provided that Q < x l l 2 (\ogx)~ B . 

We should note that other than the value of B(A) the range for Q in this result is as long 
as the range we can deduce from GRH. Indeed, GRH yields B — A + 1, whereas Bombieri 
obtained Theorem |H1 with B = 3A + 22 and more recently Vaughan 223\ gave B = A + 5/2. 

2.3 Primes in short intervals 

Throughout this section, we write p n for the nth prime number. We are interested in 
estimates for the difference p n +i — Pn between two consecutive primes. Cramer was the 
first to study this question systematically. He proved jHj that the Riemann Hypothesis 
implies 

Pn+l ~Pn < Pn /2k) gp n . 

Cramer also proposed a probabilistic model of the prime numbers that leads to very precise 
(and very bold) predictions of the asymptotic properties of the primes. In particular, he 
conjectured jlH] that 

(2.10) limsup Pn+1 ~ Pn =1. 

n-+oo log p n 

A non-trivial upper bound for — p n can be obtained as a consequence of the Prime 
Number Theorem, but Hoheisel [96J found a much sharper result. He proved unconditionaly 
the asymptotic formula 

(2.11) 7r(x + h) — tt(x) ~ /z(logx) -1 as x — » oo, 

with h = a; 1- ^ 3300 ) . There have been several improvements on Hoheisel's result and it is 
now known that (j2.11|) holds with h = x 7 ^ 12 (see Heath-Brown [86 ). Furthermore, several 
mathematicians have shown that even shorter intervals must contain primes (without estab- 
lishing an asymptotic formula for the number of primes in such intervals). The best result in 
this directions is due to Baker, Harman, and Pintz [Oj, who proved that for each n one has 

Pn+1 - Pn <p°- 525 . 

A related problem seeks small gaps between consecutive primes. In particular, the twin- 
prime conjecture can be stated in the form 

liminf(p n+ i - p n ) = 2. 
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It is an exercise to show that the Prime Number Theorem implies the inequality 



hm ml < 1 . 

Improvements on this trivial bound, on the other hand, have proved notoriously difficult 
and, so far, the best result, due to Maier |165j . is 

\imM Pn+1 ~ Pn < 0.2486. ... 

n^oo iOgp n 

2.4 Primes in sparse sequences 

We say that an infinite sequence of primes S is sparse if 

7r(<5>; x) := #{p G S : p < x} = o(n(x)) as x — > oo. 

A classical example that has attracted a great deal of attention but has proved notoriously 
difficult is that of primes represented by polynomials. To this day, there is not a single 
example of a polynomial f(X) G Z[X] of degree at least 2 which is known to take on infinitely 
many prime values. The closest approximation is a result of Iwaniec |lU4j . who showed that 
if a, b, c are integers such that a > 0, (c, 2) = 1, and the polynomial f(X) = aX 2 + bX + c is 
irreducible, then f(X) takes on infinitely many P2-numbers. On the other hand, in recent 
years there has been some exciting progress in the direction of finding polynomials in two 
variables that represent infinitely many primes. In 1998 Friedlander and Iwaniec j5S] proved 
that the polynomial X 2 + Y 4 represents infinitely many primes. We note that this polynomial 
takes on 0(x 3//4 ) values up to x. In 2001 Heath-Brown [SH] obtained an analogous result for 
the polynomial X 3 + 2Y 3 whose values are even sparser: it takes on 0(x 2//3 ) values up to x. 
Furthermore, Heath-Brown and Moroz [52] extended the latter result to general irreducible 
binary cubic forms in Z[X, Y] (subject to some mild necessary conditions). 

Another class of sparse sequences of prime numbers arises in the context of diophantine 
approximation. The two best known examples of this kind are the sequences 

(2.12) S\ = {p : p is prime with {^/p} < p X } 

and 



(2.13) V c = {p : p = [n c ] for some integer n} . 

Here, A G (0, 1) and c > 1 are fixed real numbers, {x} denotes the fractional part of the real 
number x, and [x] — x — {x}. The sequence S\ was introduced by I. M. Vinogradov, who 
proved (see |24H Chapter 4]) that if < A < 1/10, then 

7r(5 A ;x)~-- — as x oo. 

(1 — A) log a; 
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The admissible range for A has been subsequently extended toO<A<l/4by Balog [TU] 
and Harman j7nj, while Harman and Lewis jHTj showed that S\ is infinite for < A < 0.262. 

The first to study the sequence (|2.13|) was Piatetski-Shapiro [185], who considered V c as 
a sequence of primes represented by a "polynomial of degree c". Piatetski-Shapiro proved 
that V c is infinite when l<c< 12/11. The range for c has been extended several times and 
it is currently known (see Rivat and Wu |199j ) that V c is infinite when 1 < c < 243/205. 
Furthermore, it is known (see Rivat and Sargos [198.) that when 1 < c < 1.16117. . . , we 
have 

x l/c 

n(V c ',x) ~ as x — > oo. 

log a; 

3 The Hardy— Littlewood circle method 

Most of the results mentioned in the Introduction have been proved by means of the Hardy- 
Littlewood circle method. In this section, we describe the general philosophy of the circle 
method, using its applications to the Goldbach and Waring-Goldbach problems to illustrate 
the main points. 

3.1 Vinogradov's three prime theorem 
3.1.1 Preliminaries 

Using the orthogonality relation 



(3.1) / e(am) da 



1, if m = 0, 
0, if m ^ 0, 



we can express R(n) Fourier integral. We have 



(3.2) R(n) = / e(a (pi + p 2 + P3 - n)) da 



pi,P2,P3<n 
• 1 

\3 



f(a) e(—an) da, 



o 



where f(a) is the exponential sum (jl.2|) . 

The circle method uses ()3.2|) to derive an asymptotic formula for R(n) from estimates for 
f(a). The analysis of the right side of (J3.2)) rests on the observation that the behavior of f{a) 
depends on the distance from a to the set of fractions with "small" denominators. When 
a is "near" such a fraction, we expect f(a) to be "large" and to have certain asymptotic 
behavior. Otherwise, we can argue that the numbers e(ap) are uniformly distributed on the 
unit circle and hence f(a) is "small". In order to make these observations rigorous, we need 
to introduce some notation. Let B be a positive constant to be chosen later and set 

(3.3) P=(\ogn) B . 
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If a and q are integers with 1 < a < q < P and (a, q) = 1, we define the major arc e 
(3.4) Tt(q,a) 



a P a P 
q qn ' q qn 



The integration in (J3.2j) can be taken over any interval of length one and, in particular, over 
[Pn _1 , 1 + Pn" 1 ] . We partition this interval into two subsets: 



(3.5) 



m = U U m ^ a ) and m = i Pn ~^ 1 + Pn ~ l ] \ 9Jl ' 



q<P l<a<q 
(a,q) = l 



called respectively the set of major arcs and the set of minor arcs. Then from ()3.2j) and 
()3.5|) it follows that 



(3.6) 



R(n) = R(n,M)+R{n,m), 



where we have denoted 



R(n, 93) = / f(cife(-an)da. 



In the next section we explain how, using Theorem [7| and standard results from elementary 
number theory, one can obtain an asymptotic formula for R(n, DJl) (see ()3.13|) below). Then 
in &S.1.3I and &S.1.4I we discuss how one can show that R(n, m) is of a smaller order of 
magnitude than the main term in that asymptotic formula (see ()3.14|) ). 



3.1.2 The major arcs 

In this section we sketch the estimation of the contribution from the major arcs. The 
interested reader will find the missing details in Chapter 10] or \2'28\ Chapter 2]. 

It is easy to see that the major arcs 9Jt(g, a) are mutually disjoint. Thus, using ()3.4|) and 
(13. 5|) . we can write 

r P/(qn) 

(3.7) R(n,Wl) = J2 E / f(a/q + (3fe{-(a/q + f3)n)d(3. 

q<P l<a<q J - p /il n ) 
(a,q)=l 

We now proceed to approximate f(a/q + (3) by a simpler expression. To motivate our 
choice of the approximation, we first consider the case (3 = 0. We split the sum f(a/q) into 
subsums according to the residue of p modulo q and take into account the definition ()2.6|) . 
We get 

6 This term may seem a little peculiar at first, given that VJl(q, a) is in fact an interval. The explanation 
is that, in the original version of the circle method, Hardy and Littlewood used power series and Cauchy's 
integral formula instead of exponential sums and (see |228l §1.2]). In that setting, the role of 9Jl(q, a) 

is played by a small circular arc near the root of unity e(a/q). 
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/(f) -E E .(f) -E-®-^ 

Vy/ /i=l p<n \ * / ft= i \ * / 



P< 

p=h (mod q) 



The contribution of the terms with (h,q) > 1 is negligible (at most q). If (h,q) = 1, our 
choice (J3.3)) of the parameter P ensures that we can appeal to Theorem to approximate 
Tr(n; q, h) by 0(g) -1 lira. We deduce that 

(/i,g)=l 

The exponential sum on the right side of ()3.8j) is known as the Ramanujan sum and is usually 
denoted by c g (a). Its value is known for every pair of integers a and q (see |741 Theorem 
271]). In particular, when (a, q) = 1 we have c q (a) = fi(q), where \i is the Mobius function 

1, if n = 1, 

(3.9) (j,(n) = ^ (— l) fc , ii n = pi ■ ■ ■ pk is the product of k distinct primes, 
0, otherwise. 

The situation does not change much if instead of a = a/q we consider a = a/q + /3 & Wl(q, a). 
In this case we find that 

(3.10) f(e. + P y>^)-. vW+ 0(nP->), 
where 



\ogu 

Raising ()3.10|) to the third power and inserting the result into the right side of ()3.7|h we 



(3.11) R(n, Wl) = Y n) f PI{qn) v((3) 3 e(-(3n) d/3 + O (ra^ 1 ) . 

^ <P{qf J-P/( q n) 

At this point, we extend the integration over (3 to the whole real line, and then the summation 
over q to all positive integers. The arising error terms can be controlled easily by means of 
well-known bounds for the functions v((3) and <fi(q), and we find that 

(3.12) R(n,m) = G{n)J{n) + 0{n 2 p- 1 ) , 

where G(n) and J(n) are the singular series and the singular integral defined by 
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The series &(n) actually satisfies (|1.4|) . Indeed, the function 

9(q) = M<?K(-™)0(<?r 3 

is multiplicative in q, that is, g{qiq2) = g(qi)g(Q2) whenever ((71,(72) = 1- Hence, using 
the absolute convergence of &{n) and the elementary properties of the arithmetic functions 
involved in the definition of g(q), we can represent the singular series as an Euler product: 

S(n) = J2 »(«) = II f 1 + + ffP 2 ) + • • •) 



n( i -7^w)n('i+7^ 



9=1 

Also, an application of Fourier's inversion formula and some calculus reveal that 

71 ^ 

J(n) = W] ^ + 0(n 2 (logn)- 4 ). 

Therefore, if B > 4 we can conclude that 

(3.13) i2(n,<Dt) = — ^©(n) + (n 2 (logn)- 4 ) . 

/(logn)^ v 7 

3.1.3 The minor arcs 

In view of (|3.6|) and (|3.13j) . it suffices to prove that (for some B > 4) 

(3.14) i2(n, m) < n 2 (logn)~ 4 . 
We have 



(3.15) \R(n,m)\< f \f(a)\ 3 da < (sup \f(a)\) [ \f{a)\ 2 da. 
By Parseval's identity and the Prime Number Theorem, 

(3.16) / \f(a)\ 2 da = ^2l <^n(\ogn)-\ 

Jo p<n 
Thus, (I3.14j) will follow from (J3.15j) . if we show that 

(3.17) sup|/(a)| <n(logn)- 3 . 

m 

We note that the trivial estimate for f(a) is 

f(a) < ^Un(logn)- 1 , 
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so in order to establish (|3.17|) . we have to save a power of logn over this trivial estimate 
(uniformly with respect to a G m). We can do this using the following lemma, which provides 
such a saving under the assumption that a can be approximated by a reduced fraction whose 
denominator q is "neither too small, nor too large." 

Lemma 3.1. Let a be real and let a and q be integers satisfying 

1 < q < n, (a,q) = 1, \qa — a\ < q~ x . 

Then 

f(a) « (logn) 3 {nq~ 1 ' 2 + n 4 / 5 + n l l 2 q 1 ' 2 ) . 

This is the sharpest known version of the estimate for f(a) established by I. M. Vino- 
gradov 236 in 1937. As we mentioned in the Introduction, that result was the main inno- 
vation in Vinogradov's proof of Theorem ^ The above version is due to Vaughan [225J. 

We shall explain the proof of Lemma 13.11 in the next section and now we shall use it to 
establish (J3.17j) . To this end we need also the following lemma, known as Dirichlet's theorem 
on diophantine approximation; its proof is elementary and can be found in |228| Lemma 2.1]. 

Lemma 3.2 (Dirichlet). Let a and Q be real and Q > 1. There exist integers a and q such 
that 

1 < q < Q, (a, q) = 1, \qct — a\ < Q . 

Let a G m. By ()3.5|) and Lemma 13.21 with Q = nP _1 , there are integers a and q such 
that 

P < q < nP~ x , (a, q) = 1, \qa — a\ < Pn^ 1 < q~ . 
Hence, an appeal to ()3.3|) and Lemma EUJ gives 

(3.18) f(a) < (logn) 3 {nP- 1 ' 2 + n 4 / 5 ) < n{\ognf- B ' 2 . 

and (J3.17)) follows on choosing B > 12. This completes the proof of Theorem ^ 

The above proof of Vinogradov's theorem employs the Siegel-Walfisz theorem and, there- 
fore, is ineffective (recall the remark following the statement of Theorem Ej). The interested 
reader can find an effective proof (with a slightly weaker error term) in |1161 Chapter 10]. 

3.1.4 The estimation of f(a) 

The main tool in the proof of Lemma 13.11 are estimates for bilinear sums of the form 

(3.19) S= ^ ^2 ^V y e(axy). 

X<x<2XY<y<2Y 
xy<n 

We need to control two kinds of such sums, known as type I sums and type II sums. For 
simplicity, we describe these two types of sums in the simplest cases, noting that the more 
general sums arising in the actual proof of Lemma 13.11 can be reduced to these special cases 
using standard trickery: 
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• type I sums: sums ()3.19|) with \£ x \ < 1, r) y = 1 for all y, and X is "not too large"; 

• type II sums: sums (|3.19|) with \£ x \ < 1, \r) y \ < 1, and X, Y are "neither large, nor 
small" . 

Vinogradov reduced the estimation of f(a) to the estimation of type I and type II sums 
by means of an intricate combinatorial argument. Nowadays we can achieve the same result 
almost instantaneously by referring to the combinatorial identities of Vaughan |223( 225 
or Heath-Brown [HI]. Let A(k) denote von Mangoldt's function, whose value is logp or 
according as A; is a power of a prime p or not. Vaughan's identity states that if U and V are 
real parameters exceeding 1, then 

(3.20) A(Jfe) = Y M«l°gm- Y K d ) A ( m )~ Y K d )H m )- 

dm=k dlm=k dlm=k 

l<d<V l<d<V l<d<V 

l<m<U m>U,dl>V 

Heath-Brown's identity states that if k < x and J is a positive integer, then 

j=l m\---m,2j=k 
rni,...,rnj<x 1 / J 

where /i(m) is the Mobius function. 

Both identities can be used to reduce f(a) to type I and type II sums with equal success. 
Here, we apply Vaughan's identity with U — V — n 2 ^ 5 . We obtain 

(3.21) Y M k M® k ) = W 1 -W 2 - W 3 , 

k<n 

with 

W j = Y^(k)e(ak) (1<J<3) 

k<n 

where cij(k) denotes the jth sum on the right side of (|3.2(Jj) . The estimation of the sum on 
the left side of (|3.21|) is essentially equivalent to that of f(a). The sums W\ and W 2 on the 
right side of (|3.21|) can be reduced to type I sums with X n 4//5 ; W3 can be reduced to 
type II sums with n 2 ^ 5 <I,y < n 3 ^ 5 . The reader can find all the details in the proof of 
228 , Theorem 3.1]. Here we will be content with a brief description of the estimation of the 
type I and type II sums. 

Consider a type I sum Si. We have 

(3.22) \Si\ < y \ E < ax y) 

X<x<2X Y<y<Y' 

where Y' = min(2V, n/x). We can estimate the inner sum in (|3.22|) by means of the elemen- 
tary bound 

(3.23) I Y e ( a v) ^min^-a + ljall- 1 ), 

a<y<b 
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where ||a|| denotes the distance from a to the nearest integer. This inequality follows on 
noting that the sum on the left is the sum of a geometric progression. We obtain 

(3.24) \S X \ < min (Y, \\ax\l~ 1 ) = T(a), say. 

x<2X 

Obviously, the trivial estimate for T(ol) is 

T(a) < XY. 

However, under the hypotheses of Lemma \'S. 11 one can establish by elementary methods that 
(see Lemma 2.2]) 

(3.25) T(a) « XY Q + I + ^\ \og(2XYq). 

Inserting this bound into the right side of ()3.24|) . we obtain a satisfactory bound for Si. 
To estimate a type II sum S 2 , we first apply Cauchy's inequality and get 

I5 2 | 2 <r J2 I E ^< ax y) 

Y<y<2Y X<x<X> 

where X' = min(2X, n/y). Squaring out and interchanging the order of summation, we 
deduce 

\S 2 \ 2 <^Y Y Yl ^i£x 2 e(a(xi - x 2 )y) 

Y<y<2Y X<xi,x 2 <X' 

<F S e(a(xi-x 2 )y) 

X<xi,x 2 <2X Y<y<Y' 

« Y E E I E < ah y) 

X<x<2X \h\<X Y<y<Y' 

where Y < Y' < 2Y. We remark that the innermost sum is now free of "unknown" weights 
and can be estimated by means of (j3.23|) . We get 

(3.26) \S 2 \ 2 < XY 2 + XYT(a), 
and ()3.25|) again leads to a satisfactory bound for S 2 . 

3.2 The exceptional set in Goldbach's problem 

We now sketch the proof of ([1.5)1 . We will not discuss the proof of the more sophisticated 
results of Montgomery and Vaughan |173j and Pintz |186j . since they require knowledge of 
the properties of Dirichlet L-functions far beyond the scope of this survey. The reader can 
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find excellent expositions of the Montgomery- Vaughan result in their original paper and also 
in the monograph |177j . 

For an even integer n, let r(n) denote the number of representations of n as the sum of 
two primes, let Z(N) denote the set of even integers n e (N, 2N] with r(n) = 0, and write 
Z(N) = \Z(N)\. Since 



E(x) = J2 Z ( x2 ~ 

3=1 

it suffices to bound Z(N) for large N. 

Define f(a), DJl, and m as before, with N in place of n. When n is an even integer in 
(N, 2N], a variant of the method in ^3.1.21 gives 



/ f(afe(-an) da = & 2 {n) + O ( — 

J<m (logn) 2 V( l0 : 



N 



where 



S 2 (n) 



n( i ( p _ 1) 2)ii[ p _ 1 

p\n x v ' ' p\n x 



is the singular series. In particular, we have &2(n) > 1 for even n. Thus, for n G Z(N), we 
have 



(3.27) 

whence 

(3.28) 



/ f(a) 2 e(— an) da 






J m 







f(a) 2 e(—an) da 



>iV(logiV)~ 



Z(N) < N^ilogN)* [ f{afe{-an)da 



neZ(N) 



On the other hand, by Bessel's inequality, 



(3.29) 

and (ETTKJ) and (l3~T5J) yield 



n&Z(N) 







/ f(a) 2 e(— an) da 




J m 


J m 



\f{a)\Ua< (sup|/(a)|) / \f(a)\ 2 da « iV^Qog TV) 5 . 



(3.30) 

Jm 

Combining ()3.28|1 - ()3.3()|) . we conclude that 

Z(N) < NP-^hgN) 9 < iV(logiV)- A , 
on choosing, say, P = (log A^) A+9 . This completes the proof of (jl.5j) . 
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3.3 The circle method in the Waring— Goldbach problem 

We now turn our attention to Theorems El and El Much of the discussion in ffi.ll can be 
generalized to kth. powers (k > 2). Using (j3.1j) . we can write Rl s (n) as 

R ts( n ) = / f(a) s e(-an) da, 



n l ' k . 



where now 

/(a) = £ e (op*) , N 

p<N 

Define the sets of major and minor arcs as before (that is, by (J3.4J) and (|3.5)l . with P = 
()ogN) B and B = B(k,s) to be chosen later). The machinery in E J3.1.2I generalizes to kth 
powers with little extra effort. The argument leading to ()3.1()|) gives 

(3.31) f^ + p S j= (piq^Siq, a) v((3) + error term, 

where S(q, a) is defined by (|1.13|) and 



A? 



e (/3u k ) 



v((3) — \ — p du. 

J 2 log u 

We now raise (j3.31|) to the sth power and integrate the resulting approximation for f(a) 
over DJl. Using known estimates for v{(3) and S(q, a), we find that when s > k + 1, 



(3.32) / f(a) s e(-an) da = 6£» J fe *» + O 

Jot 

where @£ s (n) is defined by (|1.13|) and Jl^in) is the singular integral 

/■oo 

4*» = / v((3) s e(-(3n) d(3 



V, J ^ + O (n^-^logn)- 8 - 1 ) 

r(f) (logn) 8 1 



This reduces the proof of Theorem El to the estimate 

(3.33) / f(a) s e(-an) da <^N s ~ k (log N)- s -\ 



Notice that when k = 1 and s = 3, (|3.33|) turns into (j3.14j) . Thus, it is natural to try to 
obtain variants of (|3.16|) and (|3.17|) for f(a) when k > 2. To estimate the maximum of f(a) 
on the minor arcs, we use the same tools as in A3. 1.41 that is: 
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• Heath-Brown's or Vaughan's identity to reduce the estimation of f{a) to the estimation 
of bilinear sums 

&*% e ( a ( x y) k ) ; 

X<x<2X Y<y<2Y 
xy<N 

• Cauchy's inequality to bound those bilinear sums in terms of the quantity T(a) ap- 
pearing in ([3.24)1 . 

The following result due to Harman [73] is the analogue of Lemma f3. II for k > 2. 
Lemma 3.3. Let k > 2, let a G R, and suppose that a and q are integers satisfying 

l<q<N k , (a,q) = l, \qa - a\ < q' 1 . 
There is a constant c = c(k) > such that 

/(or) < N(logN) c (q- 1 + N~ 1/2 + qN^f'" . 

On choosing the constant B (in the definition of m) sufficiently large, one can use Lemmas 
13.21 and 1331 to show that, for any fixed A > 0, 

sup|/(a)| < Ni\ogN)- A . 

a£ra 

Hence, if s — 2r + 1, one has 

/ |/(a)| s da<sup|/(a)| f \f(a)\ 2r da<^N(\ogN)- A [ \f(a)\ 2r da, 

Jm <*gm Jo Jo 

and it suffices to establish the estimate 

(3.34) I r (N):= [ | f{a)\ 2r da < iV 2r ~ fc (log iV) c , 

Jo 

with c = c(k, r). 

3.4 Mean-value estimates for exponential sums 

We now turn to the proof of ()3.34j) . By ()3.1jl . I r (N) represents the number of solutions of 
the diophantine equation 



(3.35) 



1 < Xx, ■ ■ ■ i x 2r < -/V 



in primes xx, ■ ■ ■ , x 2r , and therefore, I r (N) does not exceed the number of solutions of ([3.35)) 
in integers Xx, . . . ,x 2r . Using ([3.1)1 to write the latter quantity as a Fourier integral, we 
conclude that 



(3.36) 



I r (N) < [ \g(a)\ 2r da, g{a) = ^ e (ax k ) . 

Jo x<N 
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This reduces the estimation of the even moments of f{a) to the estimation of the respective 
moments of the exponential sum g(a), whose analysis is much easier. In particular, we have 
the following two results. 

Lemma 3.4 (Hua's lemma). Suppose that k > 1, and let g(a) be defined by ()3.36j) . There 
exists a constant c = c(k) > such that 

(3.37) / \g{a)fda < N 2k - k {log N) c . 

Jo 



Lemma 3.5. Suppose that k > 11 and g(a) is defined by (J3.36j) . There exists a constant 



c = c{k) > such that for r > \k 2 (\ogk + log log A; + c), 



(3.38) / \g(a)\ 2r da<^ N 2r - k . 

Jo 

These lemmas are, in fact, rather deep and important results in the theory of Waring's 
problem. Unfortunately, their proofs are too complicated to include in this survey in any 
meaningful way. The reader will find a proof of a somewhat weaker version of Hua's lemma 
(with a factor of N £ in place of (logiV) c ) in \2'2S\ Lemma 2.5] and a complete proof in 
102, Theorem 4]. Results somewhat weaker than Lemma 13.51 are classical and go back to 
Vinogradov's work on Waring's problem (see |1U2[ Lemma 7.13] or |228| Theorem 7.4]). 
Lemma [3.51 itself follows from the results in Ford jHZ| (in particular, see [SZl (5.4)]). 

Combining ()3.36j) and Lemmas 13.41 and f3.5| we get ()3.34j) with 



r 



2 k ~\ if k < 10, 

[lk 2 (\ogk + \oghgk + c)] + 1, if k > 11. 



Clearly, this completes the proof of Theorem except for the case 6 < k < 8, which we will 
skip in order to avoid the discussion of certain technical details. 



3.5 Diminishing ranges 

In this section, we describe the main new idea that leads to the bounds for H (k) in Theorem 
0] This idea, known as the method of diminishing ranges, appeared for the first time in the 
work of Hardy and Littlewood on Waring's problem and later was developed into a powerfull 
technique by Davenport. 

The limit of the method employed in ^3 .31 is set by the mean- value estimates in Lemmas 
13.41 and 13.51 The key observation in the method of diminishing ranges is that it can be 
much easier to count the solutions of the equation in ()3.35|) if the unknowns x±, . . . , X2 r are 
restricted to proper subsets of [1, N]. For example, the simplest version of the method that 
goes back to Hardy and Littlewood uses that when jV 2 , . . . , N r are defined recursively by 

Nj = k~ x N^Zx k (2<j<r), 



26 



the equation 
(EM) 



,-yt k I ... I ryfc ,yj k I ... I rr> ^ 



Nj < xj, x r+j < 2Nj (1 < j < r), 

has only "diagonal" solutions with x r+ j — Xj, j — 1, . . . , r. Thus, the number of solutions of 
()3.35rj) is bounded above by 

Nt-'-Nr < Nf- X (N 2 ---N r ) 2 

where 

A=i+(i-i)+...+(i-ir i >^-^. 

That is, we have the bound 

(3.39) / \ gi (a)g 2 (a)---g r (a)\ 2 da<^N^ x (N 2 ---N r ) 2 , 
Jo 

where 

9j(<x) = Yl e ( axk ) (! < J < r )- 

Nj<x<2Nj 

We can use f!3.39|) as a replacement for the mean- value estimates in H'SAl Let Tk s (n) 
denote the number of solutions of 

Pi+P2 + ---+Ps = n 

in primes pi, . . . ,p s subject to 

Nj < Pj , Pr+j < 2Nj (1 < j < r), Nx < p 2r+1 , ...,p s < 2N ± . 

Then 

(3.40) T ktr (n) = I h{a) s - 2r+2 f 2 {a) 2 ---f r {a) 2 e{-an)da, 

Jo 

where 

/»= Yl e ( a P k ) ( l <3<r). 

Nj<p<2Nj 

When r ~ ck log k, we can use f!3.39|) to derive a bound of the form 

1 \h{a) 2 f 2 {a) ■ ■ ■ f r {a)\ 2 da « Nt\N 2 ■ ■ ■ N r ) 2 . 

Furthermore, assuming that s is just slightly larger than 2r (it suffices to assume that 
s > 2r + 3, for example), we can then obtain an asymptotic formula for the right side of 
()3.4()|) by the methods sketched in §3.31 This is (essentially) how one proves Theorem 0] for 
k > 11. The proof for k < 10 follows the same general approach, except that we use more 
elaborate choices of the parameters Nx, . . . , N r in (|3.35r|) . 
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3.6 Kloosterman's refinement of the circle method 



Consider again equation (jl.9j) with k = 2. The Hardy-Littlewood method in its original 
form establishes the asymptotic formula (jl.lOj) for s > 4, but it fails to prove Lagrange's 
four squares theorem. In 1926 Kloosterman |122j proposed a variant of the circle method, 
known today as Kloosterman's refinement, which he used to prove an asymptotic formula 
for the number of solutions of the equation 

(3.41) a\x\ + • ■ • + a^x\ = n, 

where Oj are fixed positive integers. 

Denote by J(n) the number of solutions of (|3.41|) in positive integers Xj. By (13. lj) . 

(3.42) I(n) = [ H(a)e(-an)da, 

Jo 



where 



H(a) = h(aia) ■ ■ ■ /i(a 4 a), h(a) = e (ax 2 ) , N 



n 1 ' 2 . 



x<N 



A "classical" Hardy-Littlewood decomposition of the right side of (I3.42|) into integrals over 
major and minor arcs is of little use here, since we cannot prove that the contribution from 
the minor arcs is smaller than the expected main term. Kloosterman's idea is to eliminate 
the minor arcs altogether. 

The elimination of the minor arcs requires greater care in the handling of the major arcs. 
Let X be the integer with X — 1 < N < X. It is clear that the integration in ()3.42|) can 
be taken over the interval (X" 1 , 1 + A -1 ] , which can be represented as a union of disjoint 
intervals 



(3.43) (X-\1 + X-*]=\J |J f«-J-,£ + J- 



q<N l<a<q 
(a,q)=l 



where for each pair q, a in the union, the positive integers qi = qi(q, a) and q 2 = ^(QS a ) are 
uniquely determined and satisfy the conditions 

(3.44) N <q 1 ,q 2 < 2N, aqi = 1 (mod q), aq 2 = -1 (mod q). 

The decomposition (J3.43)) is known as the Farey decomposition and provides a natural way 
of partitioning of the unit interval into non-overlapping major arcs (see Hardy and Wright 
[73| Section 3.8]). Let DJl(q,a) denote the interval in the Farey decomposition "centered" at 
a/q. We have 

(3.45) I(n) = Yl E e (— ) / H(- + p)e(-Pn)dp, 

bt^y-t, V Q J J<B(a,a) V <? ' 



q<N l<a<q 
(a,q)=l 
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where 03 (q, a) is defined by 



(3.46) Q3(g, a) = {(3 e R : a/q + (3 G 2%, a)} . 

We can find an asymptotic formula for the integrand on the right side of (|3.45p . The 
contribution of the main term in that asymptotic formula produces the expected main term 
in the asymptotic formula for I(n). However, in order to obtain a satisfactory bound for the 
contribution of the error term, we have to take into account the cancellation among terms 
corresponding to different Farey fractions a/q with the same denominator. To this end, we 
want to interchange the order of integration and summation over a in ()3.45|) . Since the 
endpoints of *B(q, a) depend on a, the total contribution of the error terms can be expressed 
as 

s}j-i/(»«o Us;, \i ) v « / J 

(a,q)=l 

where E{ajq + (3) is the error term in the major arc approximation to H(a/q + (3) and 
the superscript in Yl indicates that the summation is restricted to those a for which 
*B(q,a) 3 (3. Using (J3.44|) and (|3.46|) . we can transform the latter constraint on a into a 
condition about the multiplicative inverse of a modulo q, that is, the unique residue class a 
modulo q with aa = 1 (mod q) . Thus, a special kind of exponential sums enter the scene: 
the Kloosterman sums 

rw s (mx + nx\ 
K(q;m,n)= ^ e [ )■ 

x=l 1 
(x,q) = l 

There also other (in fact, more substantial) reasons for the Kloosterman sums to appear, 
but those are too technical to include here. 

The success of Kloosterman's method hinges on the existence of sufficiently sharp esti- 
mates for K(q;m,n). The first such estimate was found by Kloosterman himself and later 
his result has been improved. Today it is known that 

(3.48) \K(q;m,n)\ < r{q) q 1/2 {m,n,q) 1/2 , 

where (m, n, q) is the greatest common divisor of m, n, q and r(q) is the number of positive 
divisors of q. In 1948 A. Weil |243j proved (|3.48J) in the most important case: when q is a 
prime. In the general case ()3.48|) was established by Estermann . This estimate plays an 
important role not only in the Kloosterman refinement of the circle method, but in many 
other problems in number theory. 

Kloosterman's method has been applied to several additive problems, and in particular, 
to problems with primes and almost primes. We refer the reader, for example, to Ester- 
mann [SH], Hooley (HHl, Heath-Brown [H3EZIIHH1, Briidern and Fouvry [21], Heath-Brown 
and Tolev [Ml- 
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4 Sieve methods 



In this section we describe the so-called sieve methods, which are an important tool in 
analytic number theory and, in particular, in the proof of Chen's theorem (Theorem |21 in the 
Introduction). We start with a brief account of the main idea of the method (£ 14. II and N4.2)) . 
This allows us in ^4.31 to present a proof of a slightly weaker (but much simpler) version of 
Chen's result, in which P 2 - n umbers are replaced by P^numbers. We conclude the section by 
sketching some of the new ideas needed to obtain Chen's theorem in its full strength f ^4.4p 
and of some further work on sieve methods ( A4.5J1 . 

4.1 The sieve of Eratosthenes 

Let A be a finite integer sequence. We will be concerned with the existence of elements of 
A that are primes or, more generally, almost primes P r , with r bounded. In general, sieve 
methods reduce such a question to counting the elements a G A not divisible by small primes 
p from some suitably chosen set of primes ^3. To be more explicit, we consider a set of prime 
numbers ^3 and a real parameter z > 2 and define the sifting function 



where |^4| denotes the number of elements of a sequence A (not the cardinality of the 
underlying set). In applications, the set is usually taken to be the set of possible prime 
divisors of the elements of A, so the sifting function (j4.1j) counts the elements of A free of 
prime divisors p < z. 

In order to bound S(A, ^3, z), we recall the following fundamental property of the Mobius 
function (see [7U Theorem 263]): 



(4.1) 



S(A, % z) = \{a E A : (a, P(z)) = 1}| , P(z) = 



(4.2) 




Using this identity, we can express the sifting function in the form 



(4.3) 




a£A d\(a,P(z)) 



We can now interchange the order of summation to get 



(4.4) 




d\P(z) 



where 



A d = {ae A 



a = (mod d)}. 
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To this end, we suppose that there exist a (large) parameter X and a multiplicative function 
u(d) such that \Ad\ can be approximated by Xu(d)/d. We write r(X,d) for the error term 
in this approximation, that is, 

(4.5) \A d \ = X^+r(X,d). 

We expect r(X,d) to be 'small', at least in some average sense over d. Substituting ()4.5|) 
into the right side of (|4.4jl . we find that 

(4.6) S(A,¥,z) = XV(z) + R(X,z), 
where 

(4.7) V(z) = R(X,z) = Kd)r(X,d). 

d\P(z) ' d\P(z) 

We would like to believe that, under 'ideal circumstances', ()4.6|) is an asymptotic formula 
for the sifting function S(A, ^3, z), XV (z) being the main term and R(X, z) the error term. 
However, such expectations turn out to be unrealistic, as we are about to demonstrate. 
Let us try to apply ()4.6|) to bound above the number of primes < x. We choose 

(4.8) A = {n G N : n < x}, = {p : p is a prime}. 
Then 



\A, 



d\ 



X 

- + r(x, d), \r(x, d)\ < 1, 
d 



~x~ 
.d. 

that is, X = x and u(d) = 1 for all d. Using an elementary property of multiplicative 
functions (see [7U Theorem 286]), we can write V(z) as 



(4.9) v{z) = n f 1 ~ ~-) ■ 

p<z ^ V / 

When uj(p) = 1, this identity and an asymptotic formula due to Mertens (see [74l Theorem 
429]) reveal that the main term in (|4.6J) is 

(4.10) XV(z) = X TT ( 1 - - J ~ X-{— as z -> oo; 

PJ l ogz 

here 7 = 0.5772 ... is Euler's constant. Thus, if A and ^3 are as in f)4.8|) and z = x 1//2 , the 
projected 'main term' in ()4.6j) is ~ 2e~ 7 a;(loga;)~ 1 as x —>■ 00, whereas the true size of the 
sifting function on the left side is 

x 

S(A, ^P, \/x) = tt(x) — 7r(v^) + 1 ~ : as x — > 00, 

logx 
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by the Prime Number Theorem. Since 2e -7 = 1.122. . . , we conclude that the 'error term' 
R(x, y/x) is in this case of the same order of magnitude as the 'main term'. 

Identity (|4.6|) is known as the sieve of Eratosthenes-Legendre. The basic idea goes back 
to the ancient Greeks (usually attributed to Eratosthenes), while the formal exposition above 
is essentially due to Legendre, who used the above argument to show that 

TC(X) < 



log log X 



The sieve of Eratosthenes-Legendre can be extremely powerful in certain situations 7 , but in 
most cases the sum R(X, z) contains 'too many' terms for (|4.fi|) to be of any practical use 
(e.g., in the above example, R(X, z) contains 2 n ^ terms). Modern sieve methods use various 
clever approximations to the left side of ()4.2j) to overcome this problem. In the following 
sections, we describe one of the variants of one the existing approaches. The reader can find 
other constructions, comparisons of the various approaches, and proofs in the monographs 
on sieve methods |53*1 fl74j or in (§0] (see also the remarks in A4.5I for other references). 



4.2 The linear sieve 

Let y > be a parameter to be chosen later in terms of X and suppose that X + (d) and 
\~(d) are real- valued functions supported on the squarefree integers d (i.e., X ± (d) = if d is 
divisible by the square of a prime). Furthermore, suppose that 

(4.11) |A ± (rf)|<l and X ± (d) = for d > y, 
and that 

(4.12) J2 X '^ - ^2v(d) < J2 X+ ^ for all n = 1,2,.... 

d\n d\n d\n 

Using (|4.3jl and the left inequality in (|4.12|) . we obtain 

a£Ad\(a,P(z)) 

We can interchange the order of summation in the right side of this inequality and apply 
(|HHJ) and (j4TTJ) to get the bound 




d\P(z) ' d\P(z) 



For example, I. M. Vinogradov's combinatorial argument for converting sums over primes into linear 
combinations of type I and type II sums is based on a variant of (|4.6|) . See Harman |79| for other applications 
and further discussion. 
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where 



(4.13) 



co{d) 



n=J2 HX,d)\. 



d 



d\P{z) 



d\P(z) 
d<y 



In a similar fashion, we can use the right inequality in f)4.12j) to estimate the sifting function 
from above. That is, we have 



We are now in a position to overcome the difficulty caused by the "error term" in the 
Eratosthenes-Legendre sieve. The sum 1Z is similar to the error term R(X, z) defined in 
(J4.7|) . but unlike R(X, z) we can use the parameter y to control the number of terms in 
1Z. Thus, our general strategy will be to construct functions X ± (d) which satisfy (|4.11|) and 
()4.12|) and for which the sums A4 ± are of the same order as the sum V(z) defined in ()4.7|) . 
There are various constructions of such functions A ± (c?). However, since it is not our goal to 
give a detailed treatment of sieve theory here, we will simply state one of the modern sieves 
in a form suitable for an application to the binary Goldbach problem. 

The sieve method we will use is known as the Rosser-Iwaniec sieve. Its idea appeared 
for the first time in an unpublished manuscript by Rosser. The full-fledged version of this 
sieve was developed independently by Iwaniec |1U5I 106J. Suppose that the multiplicative 
function uj in (J4.5|) satisfies the condition 



where k > is an absolute constant known as the sieve dimension and K > is independent 
of wi and w-i- This inequality is usually interpreted as an average bound for the values taken 
by u)(p) when p is prime, since it is consistent with the inequality u(p) < k. In our application 
of the sieve to Goldbach's problem, we will have to deal with a sequence A (given by (|1.7jl ) 
for which ()4.15|) holds with k = 1, so we will state the Rosser-Iwaniec sieve in this special 
case, in which it is known as the linear sieve. 

Suppose that u(p) satisfies (|4.15|) with k = 1 and that 

(4.16) < u>(p) < p when p G ^3 and ^{p) = when p ^ 

We put A ± (l) = 1 and A ± (<i) = if d is not squarefree. If d > 1 is squarefree and has prime 
decomposition d = pi ■ ■ -p r , p± > P2 > ■ ■ ■ > p r , we define 



(4.14) 



xm~ -n < s(A, qj, z) < xm + + n. 



(4.15) 




(4.17) 



(4.18) 




if pi ■ ■ -pnp\i+\ < V whenever < / < (r — l)/2, 
otherwise, 

if pi ■ ■ -pn-\p\i < V whenever 1 < I < r/2, 
otherwise. 
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It can be shown (see [o^H HU5p that these two functions satisfy conditions (j4.11|) and ()4.12j) . 
Furthermore, if the quantities M. ± are defined by (|4.13|) with X ± (d) given by (|4.17j) and 
(gHHJ), we have 

(4.19) V(z) <M + < V{z) (F(s) + 0(e- s (\ogy)~ 1/3 )) for s > 1, 

(4.20) V{z) >M'> V{z) (f(s) + 0(e- s {logy)- 1/3 )) for s > 2, 

where s = logy/ log z and the functions f(s) and F(s) are the continuous solutions of a 
system of differential delay equations (see jH3 [TD5J) . The analysis of that system reveals 
that the function F(s) is strictly decreasing for s > 0, that the function f(s) is strictly 
increasing for s > 2, and that 

(4.21) < f(s) < 1 < F(s) for s > 2. 

Furthermore, both functions are very close to 1 for large s. More precisely, they satisfy 

(4.22) F(s), f(s) = 1 + 0(s- s ) as s -> oo. 
Substituting (gUSj) and (ET27I1) into (JHHJ), we obtain 

(4.23) S{A,%z) < XV (z) (F(s) + 0((\ogy)- 1/3 )) +TZ for s > 1, 

(4.24) 5(A < P^)>^^)(/( S ) + 0((logy)- 1 / 3 ))-^ for s > 2, 

where 7?. is defined by (j4.13|) . 

We now return to our initial goal — namely, to prove that the sequence A contains almost 
primes. We want to use (j4.24J) to show that 

(4.25) S(A,^ 1 X a )>0 

for some fixed a > 0. This will imply the existence of an a G A all of whose prime divisors 
exceed X a . If \a\ X 9 for all a G A, it will then follow that A contains a P r -number, where 
r < g/a. Clearly, since we want to minimize r, we would like to take a as large as possible. 
On the one hand, in order to derive ()4.25j) from ()4.24j) . we need to ensure that the main 
term in ()4.24j) is positive and that the error term 1Z is of a smaller order of magnitude than 
the main term. It is the balancing of these two requirements that determines the optimal 
choice for z and, ultimately, the quality of our result. In view of ()4.21|) . the positivity of the 
main term in ()4.24|) requires choosing y slightly larger than z 2 . On the other hand, while in 
some applications the estimation of 1Z is easier than in others, it is always the case that it 
imposes a restriction on how large we can choose y, and hence, how large we can choose z. 
In the next section, we demonstrate how this general approach works when applied to the 
binary Goldbach problem. 
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4.3 The linear sieve in the binary Goldbach problem 

In this section, we apply the linear Rosser-Iwaniec sieve to the sequence A in (jl.7|) and the 
set ^} of odd primes that do not divide n, that is, 



A = A{n) = {n — p : 2 < p < n} 



and 



*P = {p : p > 2, p\ n}. 



It is clear that all elements of A are odd numbers and that at most log n of them may have 
a common prime factor with n (for (n,n — p) > 1 implies p \ n, and n has at most logn odd 
prime factors). Thus, ^3 is the set of "typical" prime divisors of elements of A. 

Next, we proceed to define the quantity X and the multiplicative function uj(d) in ()4.5|) . 
We have 



(4.26) 



\A a 



7r(n; d, n) 



2<p<n 
p=n (mod d) 



so the prime number theorem for arithmetic progressions suggests the choice 

\d/(j){d) if {d,n) = 1, 



(4.27) 



X = li n and ui{d) 







otherwise. 



With this choice, the error terms r(X, d) defined by (|4.5jl satisfy the inequality 

lin 



\r(X,d)\ < 




7r(n; d, n) — 



if (d, n) = 1, 
otherwise. 



It then follows from the Bombieri- Vinogradov theorem (Theorem IHJ) that 

lin 



(4.28) 



d<y 



max 

(a,d)=l 



7r(n; d, a) 



n(logn)" 



whenever y < n 1 / 2 (logn) 6 . Furthermore, we have 

(4.29) V(z) = IT (l - > H (l - » (log,)"'. 



p<z 



On choosing y = n 1 / 2 (logn) 6 and z = n 2//9 , we have 



logy = 9 
logz 4 



+ 



log log n 
logn 



> 2.2, 



provided that n is sufficiently large. Hence, we deduce from (|4.21|) . ()4.24|) and ()4.27|) - (j4.29|) 

that 



(4.30) 



S(A, y,z) > n(logn)^ 2 . 
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That is, there are ^> n(logn)~ 2 elements of A that have no prime divisors smaller than n 2//9 . 
Since the numbers in A do not exceed n, the elements of A counted on the left side of (|4.3U|) 
have at most four prime divisors each, that is, the left side of (|4.3U|) counts solutions of 
n — p = Pi. 

We have some freedom in our choice of parameters in the above argument. For example, 
we could have set z = n a , where a is any fixed real number in the range 1/5 < a < 1/4. Of 
course, what we would really like to do is set z = n a , where a > 1/4. With such a choice for z, 
the above argument would establish the existence of infinitely many solutions to n — p = P3. 
Unfortunately, our choice of z is restricted (via the condition s = logy/ log z > 2) by the 
largest value of Q admissible in the Bombieri- Vinogradov theorem. In particular, in order 
to be able to choose z = n 1 / 4 , we would need a version of the Bombieri-Vinogradov theorem 
that holds for Q < x 1/2+e . 

4.4 Weighted sieves and Chen's theorem 

The idea of a weighted sieve was introduced by Kuhn |124j who observed that instead of the 
sifting function S(A, ^P, z) one may consider a more general sum of the type 

(4.31) W(A,%,z)= w( - a ^ 

aeA, 
(a,P(z))=l 

where w(a) are weights at one's disposal to choose. It is common to use weights of the form 

(4.32) w{a) = 1 - Up, 

p\a 

z<p<zi 

with suitably chosen < u p < 1. With such a choice of w(a), (J4.31j) can be written in the 
form 

(4.33) W{A,<$,z) = S(A,y,z)- ^ u p S(A p ,%p). 

z<p<zi 

We can now use an ordinary sieve to estimate the right side of (|4.33|) . For example, we 
can appeal to (|4.24|) to bound 5(^4, ^3, z) from below and to (|4.23J) to bound each sifting 
function S(A p ,^,p) from above. If the resulting lower bound for the right side of ()4.33|) is 
positive, we then conclude that there exist elements a of A with w(a) > 0. Such numbers a 
have no prime divisors p < z and the number of their prime divisors with z < p < zi can be 
controlled via the choice of the cl> p 's. 

The above idea plays an important role in improvements on the result established in ^4.31 
Using weighted sieves, Buchstab [2H] and Richert |196j proved that every sufficiently large 
even n can be represented as the sum of a prime and a P 3 -number. Richert used weights of 
the form 

p\a 
z<p<zi 
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while Buchstab's weights were somewhat more complicated. Chen's proof of Theorem Pluses 
weights of the form (|4.32j) with z = n 1 / 10 , Z\ = n 1//3 , and 



V = 2 + 2 5 ^ a ^ 



where 



5 p (a) 



1 if a = PP1P2 with pi > Zi 
otherwise. 



Here, n is the even number appearing in the statement of Theorem |21 and a is an element of 
the sequence ()1.7|) . With this choice of u p , successful sifting produces numbers a E A with 
w(a) > and no prime divisors p < n 1 ^ 10 . One can prove that any such number a must in 
fact be a P 2 - n umber. The reader can find a detailed proof of Chen's theorem in j66| Chapter 
11], HZSl Chapter 10], or [T7E1 Chapter 9]. 

4.5 Other sieve methods 

We conclude our discussion of sieve methods with a brief account of some of the important 
ideas in sieve theory left out of the previous sections. 

Selberg's sieve. The Rosser-Iwaniec sieve defined by (14.17)) and (|4.18|) is not particularly 
sensitive to the arithmetical nature of the sequence A that is being sifted. In fact, the 
only piece of information about A that the Rosser-Iwaniec sieve does take into account 
is its sieve dimension. Such sieves are known as combinatorial. Selberg |2U4j proposed 
another approach, which uses the multiplicative function oj(d) appearing in ()4.5|) to construct 
essentially best possible upper sieve weights X + (d) for a given sequence A. 
Suppose that p(d) is a real function such that p(l) = 1. Then 



where \Ad\ is as before and [^1,^2] is the least common multiple of d\ and d 2 . Using (|4.5J) . 
we find that 




We can apply this inequality to estimate S(A, ?P, z) as follows: 




di,d,2 



S{A,¥,z) < XW + W, 
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where 



W = P(di)p(d 2 ) 



d\,d,2 



uj([d 1 ,d 2 \) 
[di, d 2 ] 



1Z' = J2p(di)p(d 2 )r(X, [d 1>t Z 2 ]). 



d\,d2 



In order to control the "error term" 7Z', we further assume that p(d) = when d > £, where 
£ > is a parameter. The double sum W is a quadratic form in the variables p(d), 1 < d < £. 
Selberg's idea is to choose the values of these variables as to minimize this quadratic form. 

More information about Selberg's sieve — including the techniques used to construct the 
lower sieve function X~(d) of Selberg's sieve — can be found in I174j and in Selberg's 
collected works [51151 l2Tffi] . 

The large sieve. The method known as the large sieve was introduced in 1941 by Linnik 
[140 , but its systematic study did not commence until Renyi's work 195] on the binary 
Goldbach problem. The original idea of Linnik and Renyi evolved into a general analytic 
principle that has penetrated analytic number theory on many levels (and perhaps does not 
warrant the name "sieve" anymore, but the term has survived for historical reasons). The 
most prominent application of the large sieve is the Bombieri- Vinogradov theorem. The 
reader will find discussion of the number-theoretic aspects of the large sieve in [2m EH H71j 
and of the analytic side of the story in [HU 117H I172| . 

Alternative form of the error term in the sieve. Iwaniec |lU6j obtained a variant of 
the linear sieve featuring an error term that is better suited for certain applications than the 
error term 7Z defined in (I4.13j) . It is of the form 



where the coefficients a m and b n are bounded above in absolute value and r(X, ran) are 
the remainder terms defined earlier. In some applications, one can use the bilinearity of this 
expression to estimate the double sum when the product MN is larger than the largest value 
of y for which one can obtain a satisfactory bound for TZ. Iwaniec |lU4j used this idea in his 
proof that certain quadratic polynomials take on infinitely P2-numbers (recall E I2.4J1 . 

Prime detecting sieves. For a long time it was believed that sieve methods are not 
capable of detecting prime numbers; there are even a couple of prominent papers (see [2U 
205J) that quantify the shortcomings of the classical sieve technology. In short, classical 
sieves are incapable of distinguishing between integers having even number of prime divisors 
and those having an odd number of prime divisors (this is known in sieve theory as the parity 
obstacle). A prime detecting sieve overcomes the parity obstacle by combining the general 
sieve philosophy with additional analytic information. A variant of the basic idea can be 
traced all the way back to Vinogradov's work on sums over primes, but the first explicit 
uses of prime detecting sieves appeared in the late 1970s in investigations of the distribution 
of primes in short intervals (see jHU 1107] ) . The method flourished during the last decade 



(4.34) 




m<M n<N 
m\P(z) n\P(z) 
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and has been instrumental in the proofs of several of the restults mentioned in the previous 
sections: the result of Friedlander and Iwaniec [SS] on prime values of x 2 + y 4 ; the results 
of Heath-Brown and Moroz |H9*1 19*2*] on prime values of binary cubic forms; and the result 
of Baker, Harman, and Pintz [§| on primes in short intervals are just three such examples. 
Compared to classical sieve methods, the theory of prime detecting sieves is still in its infancy 
and thus the general literature on the subject is relatively scarce, but the reader eager to 
learn more about such matters will find two excellent expositions in and [75] . 

5 Other work on the Waring— Goldbach problem 

In the Introduction, we mentioned the cornerstones in the study of the Goldbach and Waring- 
Goldbach problems. However, as is often the case in mathematics, those results are inter- 
twined with a myriad of other results on various aspects and variants of the two main 
problems. In this final section, we describe some of the more important results of the latter 
kind. The circle method, sieve methods, or a combination of them play an essential role in 
the proofs of all these. 

5.1 Estimates for exceptional sets 

Inspired by the work of Chudakov [32] and Estermann = 54 s on the exceptional set in the 
binary Goldbach problem, Hua studied the function h(k), defined to be the least s such that 
almost all integers n < x, n = s (mod K(k)), can be written as the sum of s kth powers of 
primes (K(k) is defined by (jl.l2|) ). Let Ek, s (x) denote the number of exceptions, that is, the 
number of integers n, with n < x and n = s (mod K(k)), for which (jl.l4|) has no solution 
in primes pi, ■ ■ ■ ,p s - Hua showed (essentially) that if H(k) < so(k), then Ek tS {x) = o(x) for 
any s > ^s (k). Later, Schwarz |2U2j refined Hua's method to show that 

(5.1) E k)S (x) «x(logx)- A 

for any fixed A > 0. 

In recent years, motivated by the estimate (jl.6|) of Montgomery and Vaughan, several 
authors have pursued similar estimates for exceptional sets for squares and higher powers of 
primes. The first to obtain such an estimate were Leung and Liu jl34,, who showed that 
E2,3(%) ^ x 1_<5 , with an absolute constant 5 > 0. Explicit versions of this result were later 
given in [TBJ [HOI 11281 H591 H6Uj , the best result to date being the estimate (see Harman and 
Kumchev |HUj ) 

£2,3^) < x 6/7+£ - 

Furthermore, several authors [HUj I147| I149| 1155} I249j obtained improvements on Hua's bound 
(15. 1|) for E 2) 4 : (x), the most recent being the bound 

E 2A (x) < x 5 / 14+£ , 
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established by Harman and Kumchev jSU] • Ren |194j studied the exceptional set for sums of 
five cubes of primes and proved that 

E 3 , s {x) < x i-(«-*)/i53 (5 < s < 8). 

This estimate has since been improved by Wooley |248j and Kumchev [126]. In particular, 
Kumchev |12(ij showed that 

E 3 , 5 (x) < x 79 / 84 , E 3fi {x) < x 31 / 35 , 
E 3}7 (x) < x 51 / 84 , E 3j8 {x) < x 23 / 84 . 

Finally, Kumchev [\2Q has developed the necessary machinery to obtain estimates of the 
form Ek >s (x) <C x 1_<5 , with explicit values of 5 = 5(k, s) > 0, for all pairs of integers k > 4 
and s for which an estimate of the form (j5.1|) is known. 

In 1973 Ramachandra 192 considered the exceptional set for the binary Goldbach prob- 
lem in short intervals. He proved that if y > x 7 / l2+£ and A > 0, then 

E{x + y)- E(x) < y(\ogx)~ A , 

where the implied constant depends only on A and e. After a series of improvements on this 
result E2 E3 HIB HIS HIS CS3 IMl CHS, this estimate is now known for y > x 7 / 108+£ 
(see Jia |115j ) . Lou and Yao |164t !25Uj were the first to pursue a short interval version of 
the estimate (jl.fi j) of Montgomery and Vaughan. Their result was substantially improved by 
Peneva [180J and the best result in this direction, due to Languasco |131j . states that there 
exists a small constant 5 > such that 

E(x + y) -E(x) ^y 1 ^ 600 , 

whenever y > x 7 / 24+7<5 . 

Furthermore, J. Liu and Zhan |157j and Mikawa |169j studied the quantity E 2t3 (x) in 
short intervals and the latter author showed that 

£ 2 , 3 (x + y)- E 2>3 (x) < y(\ogx)- A 

for any fixed A > and any y > x 1 ^ 2+e . 

5.2 The Waring— Goldbach problem with almost 
primes 

There have also been attempts to gain further knowledge about the Waring-Goldbach prob- 
lem by studying closely related but more accessible problems. The most common such vari- 
ants relax the multiplicative constraint on (some of) the variables. Consider, for example, 
Lagrange's equation 

(5.2) x\ + x\ + x 2 + x 2 = n. 
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Greaves [62 proved that every sufficiently large n ^ 0, 1, 5 (mod 8) can be represented in 
the form (|5.2j) with Xi,x 2 primes and X3,x^ (unrestricted) integers. Later, Plaksin 189J and 
Shields [207 found independently an asymptotic formula for the number of such representa- 
tions. Briidern and Fouvry [23] proved that every sufficiently large integer n = 4 (mod 24) 
can be written as the sum of four squares of P34-numbers. Heath-Brown and Tolev [53] 
established, under the same hypothesis on n, that one can solve (j5.2j) in one prime and three 
almost primes of type Pioi or in four almost primes, each of type P25. Tolev |218j has recently 
improved the results in jU3], replacing the types of the almost primes involved by P§o and 
P21, respectively. We must also mention the recent result by Blomer and Briidern J7j that 
all sufficiently large integers n such that n = 3 (mod 24) and 5 \ n are sums of three almost 
primes of type P 52 i (and of type P 37 i if n is also squarefree). 

In 1951 Roth |2()()j proved that if n is sufficiently large, the equation 

(5.3) x 3 +pl~\ \-pj = n 

has solutions in primes pi,...,p-? and an integer x. Briidern [22] showed that if n = 
4 (mod 18), then x can be taken to be a P^-number, and Kawada |118j used an idea from 
Chen's proof of Theorem [2] to obtain a variant of Brudern's result for almost primes of type 
P3. Furthemore, Briidern j2H] proved that every sufficiently large integer is the sum of the 
cubes of a prime and six almost-primes (five P 5 -numbers and a Pg9-number) and Kawada 
|119j has shown that every sufficiently large integer is the sum of seven cubes of P 4 -numbers. 

Wooley [23^ showed that all but 0((logx) 6+£ ) inte gers n < x, satisfying certain natural 
congruence conditions can be represented in the form ()5.2|) with prime variables xi,X2,x% 
and an integer X4. Tolev |219j established a result of similar strength for the exceptional set 
for equation ()5.2jl with primes xi,X2,x% and an almost prime £4 of type P\\. 

5.3 The Waring— Goldbach problem with restricted 
variables 

Through the years, a number of authors have studied variants of the Goldbach and Waring- 
Goldbach problems with additional restrictions on the variables. In 1951 Haselgrove [H2] 
announced that every sufficiently large odd integer n is the sum of three primes Pi,P2,P3 
such that \pi — n/3\ < ^ 63 / 64+e . In other words, one can take the primes in Vinogradov's 
three prime theorem to be "almost equal". Subsequent work by several mathematicians 
0EHin3[Iini[nZl[2n3] tightened the range for the p;'s to \p { -n/3\ < n A ' 7 (see Baker and 
Harman [7j). 

Furthermore, Bauer, Liu, and Zhan [TB*] 11561 C58 considered the problem of representa- 
tions of an integer as sums of five squares of almost equal primes. The best result to date is 
due to Liu and Zhan |158j . who proved that every sufficiently large integer n = 5 (mod 24) 
can be written as 

n = p\ + ■ ■ ■ + pi, 

with primes pi, . . . ,ps satisfying \pf — n/5| < n 45//46+£ . Liu and Zhan |156j also showed that 
the exponent || can be replaced by |i on the assumption of GRH. 
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In 1986 Wirsing |244j proved that there exist sparse sequences of primes S such that 
every sufficiently large odd integer can be represented as the sum of three primes from S. 
However, his method was probabilistic and did not yield an example of such a sequence. 
Thus, Wirsing proposed the problem of finding "natural" examples of arithmetic sequences 
having this property. The first explicit example was given by Balog and Friedlander |12| . 
They proved that the sequence of Piatetski-Shapiro primes (recall (j2.13|0 is admissible for 
1 < c < 21/20. Jia |113j improved the range for c to 1 < c < 16/15, and Peneva [181 
studied the binary problem with a Piatetski-Shapiro prime and an almost prime. Tolev |215j - 
|217j and Peneva |179j considered additive problems with prime variables p such that the 
integers p + 2 are almost-primes. For example, Tolev [21 TJ proved that every sufficiently 
large n = 3 (mod 6) can be represented as the sum of primes Pi,P2,P3 such that p\ + 2 = P 2 , 
Pi + 2 = P 5 , and p\ + 2 = P-j. Green and Tao announced at the end of [Hlj that, using their 
method, one can prove that there are arbitrarily long non trivial arithmetic progressions 
consisting of primes p such that p + 2 = They presented in jHH] a proof of this result for 
progressions of three primes. 

5.4 Linnik's problem and variants 

In the early 1950s Linnik proposed the problem of finding sparse sequences A such that all 
sufficiently large integers n (possibly subject to some parity condition) can be represented 
as sums of two primes and an element of A. He considered two special sequences. First, 
he showed |143j that if GRH holds, then every sufficiently large odd n is the sum of three 
primes Pi,P2,P3 with pi <C (logn) 3 . Montgomery and Vaughan j!73j sharpened the bound 
on pi to pi <C (logn) 2 and also obtained an unconditional result with p\ <C n 7 / 72+e ; the 
latter bound has been subsequently improved to p\ <C n 02625 (this follows by the original 
argument of Montgomery and Vaughan from recent results of Baker, Harman, and Pintz jU] 
and Jia |114j ). 

Linnik |142[I144] was also the first to study additive representations as sums of two primes 
and a fixed number of powers of 2. He proved, first under GRH and later unconditionally, 
that there is an absolute constant r such that every sufficiently large even integer n can be 
expressed as the sum of two primes and r powers of 2, that is, the equation 

Pl +p 2 + 2 Ul + ■■■ + 1 Vr = n, 

has solutions in primes p±,p2 and non-negative integers Ui,...,u r . Later Gallagher [SU] 
established the same result by a different method. Several authors have used Gallagher's 
approach to find explicit values of the constant r above (see |137t 1138} 11501 fT^T| I152| I242J): 
in particular, Li |138j proved that r = 1906 is admissible and Wang |242j obtained r = 
160 under GRH. Recently, Heath-Brown and Puchta and Pintz and Ruzsa |187j made 
(independently) an important discovery that leads to a substantial improvement on the 
earlier results. Their device establishes Linnik's result with r = 13 (see [UH]) and with 
r = 7 under GRH (see |9*3*1 1187j ). Furthermore, Pintz and Rusza |188j have announced an 
unconditional proof of the case r = 8. 
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There is a similar approximation to the Waring-Goldbach problem for four squares of 
primes. J. Y. Liu, M. C. Liu, and Zhan j!53|. 1154] proved that there exists a constant r such 
that every sufficiently large even integer n can be expressed in the form 

v\ + p\ + p\ + p\ + 2 V1 + • • • + 2 Vr = n, 

where p%, . . . ,p± are primes and Ui, . . . ,v r are non- negative integers. J. Y. Liu and M. C. Liu 
148 established this result with r = 8330 and considered also the related problem about 
representations of integers as sums of a prime, two squares of primes and several powers of 
2. 

5.5 Additive problems with mixed powers 

In 1923 Hardy and Littlewood [71] used the general philosophy underlying the circle method 
to formulate several interesting conjectures. For example, they stated a conjectural asymp- 
totic formula for the number of representations of a large integer n in the form 

(5.4) p + x 2 + y 2 = n, 

where p is a prime and x, y are integers. Their prediction was confirmed in the late 1950s, 
first by Hooley [H7J under the assumption of GRH and then unconditionally by Linnik |145j. 
The reader will find the details of the proof in [9*Hl I146j . 

In another conjecture, Hardy and Littlewood proposed an asymptotic formula for the 
number of representations of a large integer n as the sum of a prime and a square. While 
such a result appears to lie beyond the reach of present methods, Miech |16fij showed that 
this conjecture holds for almost all integers n < x. Let Ek(x), k > 2, denote the number of 
integers n < x such that the equation n = p + x k has no solution in a prime p and an integer 
x. Miech obtained the bound E 2 (x) <^ x(\ogx)~ A for any fixed A > 0. Subsequent work 
of Briidern, Brunner, Languasco, Mikawa, Perelli, Pintz, Polyakov, A. I. Vinogradov, and 
Zaccagnini |2EJ 123 1131 EEHE1 HE31 EEI3 I2SS1 12E0 extended and sharpened Miech's estimate 
considerably. Here is a list of some of their results: 

• For any fixed k > 2, we have Ek(x) x^ Sk , where 5k > depends at most on k; see 
[2H1 IM3 ESS! for the case k = 2 and UBS EH] for the general case. 

• Assuming GRH, we have E}.(x) x 1_l5fc , where 5k = l/{k2 k ) or 5k = l/(25fc) according 
as 2 < k < 4 or k > 5; see (TEH] and 

• If k > 2 is a fixed integer and K = 2 k ~ 2 then there exists a small absolute constant 
5 > such that 

E k {x + y)-E k {x)^y l - & l^ K \ 
provided that x (7/i2)(i-i/fc)+5 < y < x - see p^j . 
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Furthermore, several mathematicians P^ITo^l^lll57j obtained variants of the above bounds 
in the case when the variable x is also restricted to primes, while Zaccagnini [252 studied 
the more general problem of representing a large integer n in the form n = p + f{x), where 

f(x)eZ[x\. 

Several interesting theorems were proved by Brudern and Kawada . For example, one 
of them states that if k is an integer with 3 < k < 5, then all sufficiently large integers n 
can be represented as 

x + p\ + pi + p\ = n, 

where pi are primes and x = P 2 . 

5.6 The Waring— Goldbach problem "with coefficients" 

In this section we discuss the solubility of equations of the form which we intro- 

duced in $ 11.41 as a natural generalization of the Waring-Goldbach problem. There are two 
substantially different contexts in which one can study this problem. Suppose first that 
all ai, . . . , a s , n are all of the same sign. Then one expects that ()1.16|) must have solutions 
for sufficiently large \n\. When "sufficiently large" is understood as |n| > C(ai, . . . , a s ), 
with some unspecified constant depending on the aj's, this is a trivial modification of the 
Waring-Goldbach problem (that can be handled using essentially the same tools). On 
the other hand, the problem of finding solution when \n\ is not too large compared to 
|a|oo = max{|ai|, . . . , \a s \} is significantly more challanging. Similarly, if ai, . . . , a s are not 
all of the same sign, one wants to find solutions of (|1.16J) in primes pi, . . . ,p s that are not 
too large compared to \sl\oo and |n|. Such questions were investigated first by Baker j^j, who 
studied the case k = 1 and s = 3. Later, Liu and Tsang showed, again for k = 1 and 
s = 3, that (jl.l6p has solutions when: 

• ai, a 2 , d3 are of the same sign and |n| ^> |a|^ for some absolute constant A > 0; 

• ai,a 2 ,a 3 are not of the same sign and max{p!,p 2 ,P3} (a^" 1 + |n|. 

In these results, the coefficients ai,a 2 ,a 3 ,n must satisfy also certain necessary congruence 
conditions (which generalize the requirement that n be odd in Vinogradov's three primes 
theorem). Through the efforts of several mathematicians, the constant A has been evaluated 
and it is known that the value A = 38 is admissible (see Li |139j ). Furthermore, if we replace 
the natural arithmetic conditions on the coefficients by another set of conditions, which are 
somewhat more restrictive but also simplify greatly the analysis, we can decrease the value 
of A further. In particular, Choi and Kumchev have shown that A = 23/3 is admissible 
under such stronger hypotheses. 

Liu and Tsang |lfci2| studied also the quadratic case of in five variables and obtained 

results similar to those stated above for the linear case. In this problem, explicit values of 
the analogue of A above were given by Choi and Liu jHHHHIl, Choi and Kumchev [37] , and 
Harman and Kumchev jHO]. In particular, it is proved in [HU] that (jl.l6j) with k = 2 and 
s = 5 has solutions when: 
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• ai, . . . , a 5 are of the same sign and |n] 3> |a|^ +e ; 

• ai, . . . , a 5 are not of the same sign and max{pi, . . . ,p 5 } <C |a|^ £ + \n\ 1 / 2 . 
5.7 Diophantine inequalities with primes 

Some variants of the Waring-Goldbach problem are stated most naturally in terms of dio- 
phantine inequalities. The best-known problem of this kind concerns the distribution of the 
values of the forms 

(5.5) X lP \ + ■■■ + X sP k s , 

where k and s are positive integers, Ai, . . . , X s are nonzero real numbers, and pi, . . . ,p s are 
prime variables. It is natural to conjecture that if Ai, . . . , X s are not all of the same sign 
and if Xi/Xj is irrational for some pair of indices i,j, then the values attained by the form 
()5.5|) are dense in K whenever s > s (k). In other words, given any e > and a G M, the 
inequality 

(5.6) |AiPi H h X s p k s - a\ < e 

should have a solution in primes p±, . . . ,p s . The first results in this problem were obtained 
by Schwarz |2U3j . who established the solvability of ()5.6|) under the same restrictions on 
s as in Theorem El Baker [S] and Vaughan |221[ I222[ I224j proposed the more difficult 
problem of replacing the fixed number e on the right side of (J5.6j) by an explicit function 
of max{pi, . . . ,p s } that approaches as max{pi, . . . ,p s } — > oo. Further work has focused 
primarily on the case of small k. For example, Harman [7S] has shown that under the above 
assumptions on Ai, A 2 , A 3 , the diophantine inequality 

|AiPi + X 2 p 2 + X 3 p 3 -a\< m&x{p 1 ,p 2 ,P3}~ 1/5+£ 

has infinitely many solutions in primes Pi,p%,P3- Baker and Harman [H] showed that on 
GRH the exponent | in this result can be replaced by |. Furthermore, Harman [77j proved 
that if A1/A2 is a negative irrational number, then for any real a the inequality 

|A 1 p + A 2 P 3 -a| <p- 1 / 300 

has infinitely many solutions in a prime p and a P3-almost prime. (This improves on an 
earlier result of Vaughan |224j . where the almost prime is a P4-number.) 

In 1952 Piatetski-Shapiro |184j considered a variant of the Waring-Goldbach problem for 
non- integer exponents c > 1. He showed that for any fixed c > 1, which is not an integer, 
there exists an integer H(c) with the following property: if s > H(c), the inequality 

(5.7) \pl + ---+p c s -a\ <e 

has solutions in primes p±,...,p s for any fixed e > and a > ao(e,c). In particular, 
Piatetski-Shapiro showed that H(c) < 5 for 1 < c < 3/2. Motivated by Vinogradov's three 
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prime theorem, Tolev |213j proved that H(c) < 3 for 1 < c < 15/14. The range of validity of 
Tolev's result was subsequently extended by several authors j321 CUE W2b\ H3(Jj ; in particular, 
Kumchev |125j has given the range 1 < c < 61/55. Furthemore, it follows from the work 
of Kumchev and Laporta [129. 1133"] that H(c) < 4 for 1 < c < 6/5 and for almost all (in 
the sense of Lebesgue measure) 1 < c < 2, while Garaev [HI] has showed that H(c) < 5 for 
1 < c < (1 + v / 5)/2 = 1.61 .... Finally, Tolev and Zhai have studied systems of 
inequalities of the form (|5.7|) . 

Several authors [TJ 121 HUB EH] have studied variants of Goldbach's problem, suggested by 
results about additive inequalities. For example, Arkhipov, Chen, and Chubarikov [2] proved 
that if A1/A2 is an algebraic irrationality, then all but 0(x 2 ^ 3+e ) positive integers n < x can 
be represented in the form 

[XiPi] + [A2P2] = n, 

where P\,P2 are primes. 



6 A new path: arithmetic progressions of 
primes 

Finally, we should say a few words about the astonishing result of Green and Tao jM] on 
the existence of arbitrarily long arithmetic progressions of prime numbers. They deduce the 
existence of such arithmetic progressions from a generalization of a celebrated theorem of 
Szemeredi [209, I21U] . which is itself a deep result in combinatorial number theory. Let A be 
a set of positive integers with positive upper density, that is, 

5(A) = hmsup — > 0. 

n-^oo N 

In its original, most basic form, Szemeredi's theorem asserts that such a set A contains an 
arithmetic progression of length k for all integers k > 3. From this basic statement, Green 
and Tao deduce the following more general result. 

Theorem 9 (Szemeredi's theorem for pseudorandom measures). Let 5 G (0, 1] be a 

fixed real number, let k > 3 be a fixed integer, and let N be a large prime. Suppose that v is a 
"k-pseudorandom measure 8 " on Zjv = (Z/iVZ) and f : Z^ — > [0, 00) is a function satisfying 



(6.1) < f(x) < v{x) for all x e Z N 

and 

8 A fc-pseudorandom measure on Zjv is a non-negative function on Zjy whose average over In is close to 
1 and which is subject to a couple of additional constraints that are too technical to state here. See [HJ for 
details. 
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Then 

(6.2) Yl E f( x )K x +r)...f(x + {k- l)r) » N 2 , 

£/ie implied constant depending at most on 5 and k. 

To relate this result to the version of Szemeredi's theorem stated earlier, consider the case 
where v[x) = 1 for all x (this is a fc-pseudorandom measure) and f(x) is the characteristic 
function of the set An — An[l, N] considered as a subset of Zn- Then the left side of ()6.2|) 
counts (essentially) the fc-term arithmetic progressions in the set An (the majority of which 
are also fc-term arithmetic progressions in An Z). 

To derive the result on arithmetic progressions of primes, Green and Tao take f(x) to be a 
function which, in some sense (see [HI] for details), approximates the characteristic function 
of the primes in the interval [ciN, C2N], where < C\ < c<i < 1 are suitable constants. 
Then they construct a pseudorandom measure v(x) such that (j6.1|) holds. This leads to the 
following theorem. 

Theorem 10 (Green and Tao, 2004). Let k > 3 and let A be a set of prime numbers 
such that 

#{neA:n<N} 
hmsup — — > 0. 

N^oo 7T(iVj 

Then A contains infinitely many k-term arithmetic progresions. In particular, there are 
infinitely many k-term arithmetic progresions of prime numbers. 

We remark that the infinitude of the /c-term progressions of primes is a consequence of 
(|fi.2|) . In fact, using the explicit form of the function f(x) to which they apply Theorem 
EH Green and Tao establish the existence of 3> N 2 (\ogN)~ k fc-term progressions within 

An[i,N}. 

Several other interesing results are announced in 12]. For example, one of them asserts 
that there are infinitely many progressions of primes p\ , . . . , pk such that each pi + 2 is a 
P2-number (a proof of this result in the case k = 3 is presented in [65]). 
Conclusion. With this, our survey comes to a close. We tried to describe the central problems 
and the main directions of research in the additive theory of prime numbers and to introduce 
the reader to the classical methods. Complete success in such an undertaking is perhaps an 
impossibility, but hopefully we have been able to paint a representative picture of the current 
state of the subject and to motivate the reader to seek more information from the literature. 
Maybe some of our readers will one day join the ranks of the number theorists trying to turn 
the great conjectures mentioned above into beautiful theorems! 
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